Enhancing specificity of analyte binding

ABSTRACT

Methods for enhancing specificity of an analyte binding moiety or probe oligonucleotide to an analyte are provided herein. For example, methods provided herein include blocking a capture binding domain, thereby preventing hybridization to the capture domain of the capture probe affixed to a substrate. Further methods include releasing the block from the capture binding domain, thereby allowing the capture binding domain to specifically bind to the capture domain of the capture probe on the substrate.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a divisional of U.S. Pat. Application 17/573,126, filed Jan. 11, 2022, which is a continuation of International Application PCT/US2020/059472, with an international filing date of Nov. 6, 2020, which claims priority to U.S. Provisional Pat. Application No. 63/040,292, filed Jun. 17, 2020, and U.S. Provisional Pat. Application No. 62/933,299, filed Nov. 8, 2019. The contents of each of these applications are incorporated herein by reference in their entireties.

BACKGROUND

Cells within a tissue of a subject have differences in cell morphology and/or function due to varied analyte levels (e.g., gene and/or protein expression) within the different cells. The specific position of a cell within a tissue (e.g., the cell’s position relative to neighboring cells or the cell’s position relative to the tissue microenvironment) can affect, e.g., the cell’s morphology, differentiation, fate, viability, proliferation, behavior, and signaling and crosstalk with other cells in the tissue.

Spatial heterogeneity has been previously studied using techniques that only provide data for a small handful of analytes in the context of an intact tissue or a portion of a tissue, or provide a lot of analyte data for single cells, but fail to provide information regarding the position of the single cell in a parent biological sample (e.g., tissue sample).

Increasing resolution of spatial heterogeneity can be achieved by increasing capture efficiency or by reducing background signal. This is usually achieved by relying on the affinity of the capture reagents and/or optimized reaction conditions, neither of which address methods with a second capture step, for example, when using antibody attached oligonucleotides to target an analyte. Therefore, there remains a need to develop strategies to enhance the specificity of binding to target analytes when a second capture step is involved.

SUMMARY

This disclosure features improvements to a spatial heterogeneity workflow that includes a step of reversibly blocking capture binding domains until a user prefers to expose it to a plurality of capture probes.

In one aspect, this disclosure features a method for identifying the location of an analyte in a biological sample comprising (a) contacting the biological sample with a substrate comprising a plurality of capture probes comprising a spatial barcode and a capture domain; (b) binding an analyte-binding moiety to the analyte, wherein the analyte-binding moiety is associated with an oligonucleotide comprising an analyte-binding-moiety barcode and a capture binding domain that hybridizes to the capture domain of the capture probe, wherein a portion of the capture binding domain is blocked; (c) releasing the blocking probe from the capture binding domain and hybridizing the capture binding domain to the capture domain; and (d) determining (i) all or a part of the sequence of the oligonucleotide bound to the capture domain, or a complement thereof, and (ii) all or a part of the sequence of the spatial barcode, or a complement thereof, and using the determined sequence of (i) and (ii) to identify a location of the analyte in the biological sample. In some instances, the method includes further determining the abundance of the analyte in the biological sample. In some instances, the capture binding domain is blocked using a blocking probe. In other instances, the capture binding domain is blocked using artificial nucleic acids such as plurality of caged nucleotides in the capture binding domain itself.

In some instances, also disclosed herein is a method for enhancing the binding of a capture binding domain to a capture domain of a capture probe comprising: (a) contacting a biological sample with a substrate comprising a plurality of capture probes comprising a spatial barcode and the capture domain; (b) binding an analyte-binding moiety to an analyte, wherein the analyte-binding moiety is associated with an oligonucleotide comprising an analyte-binding-moiety barcode and the capture binding domain, wherein a portion of the capture binding domain is blocked; and (c) releasing the blocking probe from the capture binding domain, thereby allowing the capture binding domain to hybridize to the capture domain; thereby enhancing the binding of the capture binding domain to the capture domain. In some instances, the capture binding domain is blocked using a blocking probe. In other instances, the capture binding domain is blocked using artificial nucleic acids such as plurality of caged nucleotides in the capture binding domain itself. In some instances, the method further includes determining (i) all or a part of the sequence of the oligonucleotide specifically bound to the capture domain, or a complement thereof, and (ii) all or a part of the sequence of the spatial barcode, or a complement thereof, and using the determined sequence of (i) and (ii) to identify a location of the analyte in the biological sample.

In one aspect, this disclosure features methods identifying the location and/or abundance of an analyte in a biological sample including: (a) contacting the biological sample with a substrate including a plurality of capture probes, wherein a capture probe of the plurality of capture probes includes a spatial barcode and a capture domain; (b) binding an analyte-binding moiety to the analyte, wherein the analyte-binding moiety is associated with an oligonucleotide including an analyte-binding-moiety barcode and a capture probe binding domain that hybridizes to the capture domain, wherein all or a portion of the capture probe binding domain is hybridized to a blocking probe; (c) releasing the blocking probe from the capture probe binding domain, thereby allowing the capture probe binding domain to hybridize to the capture domain; and (d) determining (i) all or a part of the sequence of the oligonucleotide specifically bound to the capture domain, or a complement thereof, and (ii) all or a part of the sequence of the spatial barcode, or a complement thereof, and using the determined sequence of (i) and (ii) to identify a location of the analyte in the biological sample.

In another aspect, this disclosure features methods for enhancing the specificity of binding of a capture probe binding domain to a capture domain including: (a) contacting a biological sample with a substrate including a plurality of capture probes, wherein a capture probe of the plurality of capture probes includes a spatial barcode and the capture domain; (b) binding an analyte-binding moiety to the analyte, wherein the analyte-binding moiety is associated with an oligonucleotide including an analyte-binding-moiety barcode and the capture probe binding domain that hybridizes to the capture domain, wherein all or a portion of the capture probe binding domain is hybridized to a blocking probe; and (c) releasing the blocking probe from the capture probe binding domain, thereby allowing the capture probe binding domain to hybridize to the capture domain; thereby enhancing the specificity of binding of the capture probe binding domain to the capture domain.

In another aspect, this disclosure features methods for enhancing the specificity of binding of an analyte-binding moiety to a target analyte in a biological sample where the method includes: (a) contacting the biological sample with a substrate, where the substrate includes a plurality of capture probes affixed to the substrate, where the capture probe includes a spatial barcode and the capture domain; (b) binding an analyte-binding moiety to a target analyte in the biological sample, where the analyte-binding moiety is associated with an oligonucleotide including a capture probe binding domain that hybridizes to a capture domain of a capture probe, where the capture probe binding domain is blocked and prevented from hybridizing to the capture domain of the capture probe affixed to the substrate; (c) releasing the block from the capture probe binding domain, thereby allowing the capture probe binding domain to specifically bind to the capture domain of the capture probe on the substrate, thereby enhancing the specificity of binding of an analyte-binding moiety to a target analyte in a biological sample. In some embodiments, the method further includes determining (i) all or a part of the sequence of the oligonucleotide specifically bound to the capture domain, or a complement thereof, and (ii) all or a part of the sequence of the spatial barcode, or a complement thereof, and using the determined sequence of (i) and (ii) to identify a location of the target analyte in the biological sample.

Also disclosed herein is a method for decreasing background binding on a substrate comprising: (a) contacting the biological sample with the substrate comprising a plurality of capture probes comprising a spatial barcode and a capture domain; (b) binding an analyte-binding moiety to the analyte, wherein the analyte-binding moiety is associated with an oligonucleotide comprising an analyte-binding-moiety barcode and a capture binding domain that hybridizes to the capture domain of the capture probe, wherein a portion of the capture binding domain is blocked; (c) releasing the blocking probe from the capture binding domain and hybridizing the capture binding domain to the capture domain; thereby decreasing background binding on a substrate. In some instances, the capture binding domain is blocked using a blocking probe. In other instances, the capture binding domain is blocked using artificial nucleic acids such as plurality of caged nucleotides in the capture binding domain itself.

In some embodiments, the analyte-binding moiety is a protein. In some embodiments, the protein is an antibody. In some embodiments, the antibody is a monoclonal antibody, recombinant antibody, synthetic antibody, a single domain antibody, a single-chain variable fragment (scFv), and or an antigen-binding fragment (Fab).

In some embodiments, the analyte-binding moiety is associated with the oligonucleotide via a linker. In some embodiments, the linker is a cleavable linker. In some embodiments, the cleavable linker is a photocleavable linker, UV-cleavable linker, or an enzyme-cleavable linker. In some embodiments, the cleavable linker is an enzyme cleavable linker.

In some embodiments, the method further includes releasing the oligonucleotide from the analyte-binding moiety. In some embodiments, the oligonucleotide includes a free 3′ end.

In another aspect, the disclosure includes a method for identifying the location of an analyte in a biological sample comprising (a) contacting the biological sample with a substrate comprising a plurality of capture probes comprising a spatial barcode and a capture domain; (b) hybridizing a first probe oligonucleotide and a second probe oligonucleotide to the analyte, wherein the second probe oligonucleotide comprises a capture binding domain that hybridizes to the capture domain, wherein a portion of the capture probe domain is blocked; (c) ligating the first probe oligonucleotide and the second probe oligonucleotide, thereby creating the ligated probe, wherein the ligated probe is substantially complementary to the analyte; and (d) releasing: (i) the ligated probe from the analyte, and (ii) the blocking probe from the capture binding domain, thereby allowing the capture binding domain to bind to the capture domain of the capture probe on the substrate; and (e) determining (i) all or a part of the sequence of the ligated probe bound to the capture domain, or a complement thereof, and (ii) all or a part of the sequence of the spatial barcode, or a complement thereof, and using the determined sequence of (i) and (ii) to identify a location of the analyte in the biological sample. In some instances, the method includes further determining the abundance of the analyte in the biological sample. In some instances, the capture binding domain is blocked using a blocking probe. In other instances, the capture binding domain is blocked using artificial nucleic acids such as plurality of caged nucleotides in the capture binding domain itself.

In another aspect, the disclosure includes a method for enhancing the binding of a ligated probe to a capture domain comprising (a) contacting a biological sample with a substrate comprising a plurality of capture probes comprising a spatial barcode and the capture domain; (b) hybridizing a first probe oligonucleotide and a second probe oligonucleotide to an analyte, wherein the second probe oligonucleotide comprises a capture binding domain that hybridizes to the capture domain, wherein a portion of the capture binding domain is blocked; (c) ligating the first probe oligonucleotide and the second probe oligonucleotide, thereby creating the ligated probe, wherein the ligated probe is substantially complementary to the analyte; and (d) releasing: (i) the ligated probe from the analyte, and (ii) the blocking probe from the capture binding domain, thereby allowing the capture binding domain to bind to the capture domain of the capture probe on the substrate; thereby enhancing the binding of the ligated probed to the capture domain. In some instances, the method includes further determining the abundance of the analyte in the biological sample. In some instances, the capture binding domain is blocked using a blocking probe. In other instances, the capture binding domain is blocked using artificial nucleic acids such as plurality of caged nucleotides in the capture binding domain itself.

In some instances, the method further includes determining the abundance of the analyte in the biological sample. In some instances, the first probe oligonucleotide and the second probe oligonucleotide are substantially complementary to adjacent sequences of the analyte. In some instances, the method includes further determining (i) all or a part of the sequence of the ligated probe, or a complement thereof, and (ii) all or a part of the sequence of the spatial barcode, or a complement thereof, and using the determined sequence of (i) and (ii) to identify the location of the analyte in the biological sample.

In another aspect, this disclosure features methods for identifying the location and/or abundance of an analyte in a biological sample including: (a) contacting the biological sample with a substrate including a plurality of capture probes, wherein a capture probe of the plurality of capture probes includes a spatial barcode and a capture domain; (b) hybridizing a first probe oligonucleotide and a second probe oligonucleotide to the analyte, wherein: (i) the first probe oligonucleotide and the second probe oligonucleotide are substantially complementary to adjacent sequences of the analyte, and (ii) the second probe oligonucleotide includes a capture probe binding domain that hybridizes to the capture domain, wherein all or a portion of the capture probe binding domain is hybridized to a blocking probe, thereby preventing hybridization of the capture probe binding domain to the capture domain; (c) ligating the first probe oligonucleotide and the second probe oligonucleotide, thereby creating the ligated probe, wherein the ligated probe is substantially complementary to the analyte; and (d) releasing: (i) the ligated probe from the analyte, and (ii) the blocking probe from the capture probe binding domain, thereby allowing the capture probe binding domain to specifically bind to the capture domain of the capture probe on the substrate; thereby enhancing the specificity of binding of a polynucleotide to the analyte in a biological sample.

In another aspect, this disclosure features methods for enhancing the specificity of binding of a ligated probe to a capture domain including: (a) contacting a biological sample with a substrate including a plurality of capture probes, wherein a capture probe of the plurality of capture probes includes a spatial barcode and the capture domain; (b) hybridizing a first probe oligonucleotide and a second probe oligonucleotide to the analyte, wherein: (i) the first probe oligonucleotide and the second probe oligonucleotide are substantially complementary to adjacent sequences of the analyte, and (ii) the second probe oligonucleotide includes a capture probe binding domain that hybridizes to the capture domain, wherein all or a portion of the capture probe binding domain is hybridized to a blocking probe, thereby preventing hybridization of the capture probe binding domain to the capture domain; (c) ligating the first probe oligonucleotide and the second probe oligonucleotide, thereby creating the ligated probe, wherein the ligated probe is substantially complementary to the analyte; and (d) releasing: (i) the ligated probe from the analyte, and (ii) the blocking probe from the capture probe binding domain, thereby allowing the capture probe binding domain to specifically bind to the capture domain of the capture probe on the substrate; thereby enhancing the specificity of binding of a polynucleotide to the analyte in a biological sample.

In some instances, also disclosed is a method for decreasing background binding on a substrate comprising: (a) contacting a biological sample with a substrate comprising a plurality of capture probes comprising a spatial barcode and a capture domain; (b) hybridizing a first probe oligonucleotide and a second probe oligonucleotide to an analyte, wherein the second probe oligonucleotide comprises a capture binding domain that hybridizes to the capture domain, wherein a portion of the capture binding domain is hybridized to a blocking probe; (c) ligating the first probe oligonucleotide and the second probe oligonucleotide, thereby creating the ligated probe, wherein the ligated probe is substantially complementary to the analyte; and (d) releasing: (i) the ligated probe from the analyte, and (ii) the blocking probe from the capture binding domain, thereby allowing the capture binding domain to bind to the capture domain of the capture probe on the substrate; thereby decreasing background binding on a substrate. In some instances, the capture binding domain is blocked using a blocking probe. In other instances, the capture binding domain is blocked using artificial nucleic acids such as plurality of caged nucleotides in the capture binding domain itself. In some instances, the methods further comprise determining (i) all or a part of the sequence of the oligonucleotide specifically bound to the capture domain, or a complement thereof, and (ii) all or a part of the sequence of the spatial barcode, or a complement thereof, and using the determined sequence of (i) and (ii) to identify a location of the analyte in the biological sample.

In another aspect, this document features methods for enhancing the specificity of binding of a first analyte-binding moiety to a first target analyte and a second analyte binding moiety to a second target analyte in a biological sample where the method includes: (a) contacting the biological sample with a substrate, where the substrate includes a plurality of capture probe affixed to the substrate, where the capture probe includes a spatial barcode and a capture domain; (b) binding a first analyte-binding moiety to a first target analyte in the biological sample, where the first analyte-binding moiety is associated with a first oligonucleotide including: (i) a first barcode that identifies the first analyte-binding moiety; and (ii) a first bridge sequence; (c) binding a second analyte-binding moiety to a second target analyte in the biological sample, where the second analyte-binding moiety is bound to a second oligonucleotide including: (i) a capture probe binding domain that binds to a capture domain on a capture probe, (ii) a second barcode that identifies second analyte-binding moiety; and (iii) a second bridge sequence; where the capture probe binding domain is blocked and prevented from hybridizing to the capture domain of the capture probe affixed to the substrate; (d) contacting the biological sample with a third oligonucleotide; (e) hybridizing the third oligonucleotide to the first bridge sequence of the first oligonucleotide and the second bridge sequence of the second oligonucleotide; (f) ligating the first oligonucleotide and the second oligonucleotide, thereby creating a ligated probe; and (g) releasing the block from the capture probe binding domain, thereby allowing the capture probe binding domain of the second oligonucleotide to specifically bind to the capture domain of the capture probe on the substrate; thereby enhancing the specificity of binding of a first analyte-binding moiety to a first target analyte and a second analyte binding moiety to a second target analyte in a biological sample.

In some embodiments, in the methods disclosed herein, the first analyte-binding moiety is a first protein. In some embodiments, the first protein is a first antibody. In some embodiments, the first antibody is a monoclonal antibody, recombinant antibody, synthetic antibody, a single domain antibody, a single-chain variable fragment (scFv), and or an antigen-binding fragment (Fab). In some embodiments, the first analyte-binding moiety is associated with the first oligonucleotide via a first linker. In some embodiments, the first linker is a cleavable linker.

In some embodiments, the first cleavable linker is a photocleavable linker, UV-cleavable linker, or an enzyme-cleavable linker. In some embodiments, the first cleavable linker is an enzyme cleavable linker.

In some instances, the first probe oligonucleotide comprises at least two ribonucleic acid bases at the 3′ end, and wherein the second probe oligonucleotide comprises a phosphorylated nucleotide at the 5′ end. In some instances, the first probe oligonucleotide and the second probe oligonucleotide are DNA probes.

In some embodiments, the first oligonucleotide includes a free 3′ end. In some embodiments, the first oligonucleotide includes from 5′ to 3′: a first barcode and a first bridge sequence. In some embodiments, the first oligonucleotide includes from 5′ to 3′: a primer sequence, a first barcode, and a first bridge sequence.

In some embodiments, the second analyte-binding moiety is a second protein. In some embodiments, the second protein is a second antibody. In some embodiments, the second antibody is a monoclonal antibody, recombinant antibody, synthetic antibody, a single domain antibody, a single-chain variable fragment (scFv), and or an antigen-binding fragment (Fab).

In some embodiments, the second analyte-binding moiety is associated with the second oligonucleotide via a second linker. In some embodiments, the second linker is a second cleavable linker. In some embodiments, the second cleavable linker is a photocleavable linker, UV-cleavable linker, or an enzyme-cleavable linker. In some embodiments, the second cleavable linker is an enzyme cleavable linker.

In some embodiments, the second oligonucleotide includes a free 5′ end. In some embodiments, the second oligonucleotide includes a phosphorylated nucleotide at the 5′ end. In some embodiments, the second oligonucleotide includes from 3′ to 5′: a capture probe binding domain, a second barcode, and a second bridge sequence.

In some embodiments, the first analyte and the second analyte are the same. In some embodiments, the first analyte and the second analyte are different.

In some embodiments, the third oligonucleotide includes a sequence that is partially complementary to the first bridge sequence. In some embodiments, the third oligonucleotide includes a sequence that is partially complementary to the second bridge sequence.

In some embodiments, the ligation step includes ligating (i) the first oligonucleotide and the second oligonucleotide using enzymatic or chemical ligation. In some embodiments, the enzymatic ligation utilizes a ligase. In some embodiments, the ligase is one or more of a T4 RNA ligase (Rnl2), a SplintR ligase, a single stranded DNA ligase, or a T4 DNA ligase. In some embodiments, the ligase is a T4 RNA ligase 2 (Rnl2) ligase. In some embodiments, the method further includes releasing the ligated probe.

In some embodiments, the releasing includes removing the first oligonucleotide from the first analyte-binding moiety. In some embodiments, the releasing includes removing the second oligonucleotide from the second analyte-binding moiety.

In another aspect, this disclosure features methods for enhancing the specificity of binding of an oligonucleotide to a target analyte in a biological sample where the method includes: (a) contacting the biological sample with a substrate, where the substrate includes a plurality of capture probes affixed to the substrate, where the capture probe includes a spatial barcode and the capture domain; (b) binding a first probe oligonucleotide and a second probe oligonucleotide to a target analyte in the biological sample, where: the first probe oligonucleotide and the second probe oligonucleotide are substantially complementary to adjacent sequences of the analyte, the second probe oligonucleotide includes a capture probe binding domain that binds to a capture domain of a capture probe, where the capture probe binding domain is blocked and prevented from hybridizing to the capture domain of the capture probe affixed to the substrate; (c) ligating the first probe oligonucleotide and the second probe oligonucleotide, thereby creating a ligated probe that is substantially complementary to the analyte; (d) releasing: the ligated probe from the analyte, and the block from the capture probe binding domain, thereby allowing the capture probe binding domain to specifically bind to the capture domain of the capture probe on the substrate, thereby enhancing the specificity of binding of a polynucleotide to a target analyte in a biological sample.

In some embodiments, the method further includes determining (i) all or a part of the sequence of the ligated probe, or a complement thereof, and (ii) all or a part of the sequence of the spatial barcode, or a complement thereof, and using the determined sequence of (i) and (ii) to identify the location of the analyte in the biological sample.

In some embodiments, the first probe oligonucleotide includes at least two ribonucleic acid bases at the 3′ end.

In some embodiments, the first and second probe oligonucleotides are DNA probes.

In some embodiments, the first probe oligonucleotide further includes a functional sequence. In some embodiments, the functional sequence is a primer sequence.

In some embodiments, the second probe oligonucleotide includes a phosphorylated nucleotide at the 5′ end.

In some embodiments, the ligation step include ligating (i) the first probe oligonucleotide and the second probe oligonucleotide using enzymatic or chemical ligation. In some embodiments, the enzymatic ligation utilizes a ligase. In some embodiments, the ligase is one or more of a T4 RNA ligase (Rnl2), a SplintR ligase, a single stranded DNA ligase, or a T4 DNA ligase. In some embodiments, the ligase is a T4 RNA ligase 2 (Rnl2) ligase.

In some instances, the blocking probe and the capture binding domain are two different (noncontiguous or separate) nucleotide sequences, wherein the capture binding domain comprises a first sequence substantially complementary to the capture domain. In some instances, the capture binding domain comprises a contiguous sequence comprising (i) a first sequence substantially complementary to the capture domain and (ii) the blocking probe.

In some embodiments, the capture probe binding domain includes a contiguous sequence including a first sequence substantially complementary to the capture domain of the capture probe and a second sequence substantially complementary to the first sequence, where the second sequence blocks first sequence of the capture probe binding domain from hybridizing to the capture domain of the capture probe. In some embodiments, the second sequence includes a sequence configured to hybridize to the first sequence capture probe binding domain. In some embodiments, the second sequence includes a sequence that is substantially complementary to the first sequence of the capture probe binding domain. In some embodiments, the second sequence includes a homopolymeric sequence that is substantially complementary to the first sequence of the capture probe binding domain. In some embodiments, the second sequence is configured to hybridize to a poly(A), poly(T) or a poly-rU sequence. In some embodiments, the second sequence includes a poly(U) sequence. In some embodiments, the first sequence includes a homopolymeric sequence. In some embodiments, first sequence includes a poly(A) sequence or a poly(T) sequence.

In some embodiments, the capture probe binding domain further includes a hairpin sequence. In some embodiments, the hairpin sequence is located 5′ of the second sequence in the capture probe binding domain. In some embodiments, the capture probe binding domain includes from 5′ to 3′: a first sequence that is substantially complementary to the capture domain of the capture probe, a hairpin sequence, and a second sequence substantially complementary to the first sequence.

In some embodiments, the hairpin sequence include a sequence of about three nucleotides, about four nucleotides, about five nucleotides, about six nucleotides, about seven nucleotides, about eight nucleotides, about nine nucleotides or about 10 or more nucleotides. In some embodiments, the hairpin sequence includes DNA, RNA, DNA-RNA hybrid, or modified nucleotides.

In some embodiments, the hairpin sequence includes a cleavable linker. In some embodiments, the cleavable linker is a photocleavable linker, UV-cleavable linker, or an enzyme-cleavable linker. In some embodiments, the enzyme that cleaves that enzymatic-cleavable domain is an endonuclease. In some embodiments, where the hairpin sequence includes a target sequence for a restriction endonuclease.

In some embodiments, releasing the second sequence from the first sequence includes contacting with a restriction endonuclease. In some embodiments, releasing the second sequence from the first sequence includes contacting the second sequence with an endoribonuclease.

In some embodiments, the endoribonuclease is one or more of RNase H, RNase A, RNase C, or RNase I. In some embodiments, the endoribonuclease is RNAseH. In some embodiments, the RNase H includes RNase H1, RNase H2, or RNase H1 and RNase H2.

In some embodiments, the hairpin sequence includes a homopolymeric sequence. In some embodiments, the hairpin sequence includes a poly(T) or poly(U) sequence. In some embodiments, hairpin sequence includes a poly(U) sequence.

In some embodiments, releasing the blocking domain (i.e., also called a second sequence herein) includes contacting the hairpin sequence with a Uracil-Specific Excision Reagent (USER) enzyme. In some embodiments, releasing the second sequence includes denaturing the second sequence.

In some embodiments, the hairpin sequence further includes a sequence that is capable of binding to a capture domain.

In some embodiments, the cleavage of the hairpin results in a single stranded sequence that is capable of binding to a capture probe binding domain on a spatial array.

In some embodiments, the capture probe binding domain includes a plurality of caged nucleotides, where a caged nucleotide of the plurality of caged nucleotides includes a caged moiety that blocks the capture probe binding domain from hybridizing to the capture domain of the capture probe. In some embodiments, releasing the caged moiety from the nucleotide includes activating the caged moiety, thereby by allowing the capture probe binding domain to specifically bind to the capture domain of the capture probe. In some embodiments, the activating the caged moiety includes photolysis of the caged moiety from the nucleotide. In some embodiments, the activating includes exposing the caged moiety to light pulses, where the light is at a wavelength of about less than 360 nm. In some embodiments, the light at the wavelength of about 360 nm is produced by a UV light. In some embodiments, the UV light originates from a fluorescence microscope, a UV laser, or a UV flashlamp.

Also disclosed herein is a method for identifying the location of an analyte in a biological sample comprising: (a) contacting the biological sample with a substrate comprising a plurality of capture probes comprising a spatial barcode and a capture domain; (b) binding an analyte-binding moiety to the analyte, wherein the analyte-binding moiety is associated with an oligonucleotide comprising an analyte-binding-moiety barcode and a capture binding domain that hybridizes to the capture domain of the capture probe, wherein the capture binding domain comprises a plurality of caged nucleotides, wherein a caged nucleotide of the plurality of caged nucleotides comprises a caged moiety that blocks the capture binding domain from hybridizing to the capture domain; (c) releasing the caged moiety from the capture binding domain and hybridizing the capture binding domain to the capture domain; and (d) determining (i) all or a part of the sequence of the oligonucleotide bound to the capture domain, or a complement thereof, and (ii) all or a part of the sequence of the spatial barcode, or a complement thereof, and using the determined sequence of (i) and (ii) to identify a location of the analyte in the biological sample.

In some instances, disclosed herein is a method for identifying the location of an analyte in a biological sample comprising: (a) contacting the biological sample with a substrate comprising a plurality of capture probes comprising a spatial barcode and a capture domain; (b) hybridizing a first probe oligonucleotide and a second probe oligonucleotide to the analyte, wherein the second probe oligonucleotide comprises a capture binding domain that hybridizes to the capture domain, wherein the capture binding domain comprises a plurality of caged nucleotides, wherein a caged nucleotide of the plurality of caged nucleotides comprises a caged moiety that blocks the capture binding domain from hybridizing to the capture domain; (c) ligating the first probe oligonucleotide and the second probe oligonucleotide, thereby creating the ligated probe, wherein the ligated probe is substantially complementary to the analyte; and (d) releasing: (i) the ligated probe from the analyte, and (ii) the caged moiety from the capture binding domain, thereby allowing the capture binding domain to bind to the capture domain of the capture probe on the substrate; and (e) determining (i) all or a part of the sequence of the ligated probe bound to the capture domain, or a complement thereof, and (ii) all or a part of the sequence of the spatial barcode, or a complement thereof, and using the determined sequence of (i) and (ii) to identify a location of the analyte in the biological sample. In some instances, the method further includes determining the abundance of the analyte in the biological sample. In some instances, the first probe oligonucleotide and the second probe oligonucleotide are substantially complementary to adjacent sequences of the analyte.

In some embodiments, the caged nucleotide includes a caged moiety selected from the group consisting of 6-nitropiperonyloxymethy (NPOM), 1-(ortho-nitrophenyl)-ethyl (NPE), 2-(ortho-nitrophenyl) propyl (NPP), diethylaminocoumarin (DEACM), and nitrodibenzofuran (NDBF). In some embodiments, the caged nucleotide includes a non-naturally-occurring nucleotide selected from the group consisting of 6-nitropiperonyloxymethy (NPOM)-caged guanosine, 6-nitropiperonyloxymethy (NPOM)-caged uridine, and 6-nitropiperonyloxymethy (NPOM)-caged thymidine.

In some embodiments, the capture probe binding domain includes one caged nucleotide, two caged nucleotides, three caged nucleotides, four caged nucleotides, five caged nucleotides, six caged nucleotides, seven caged nucleotides, eight caged nucleotides, nine caged nucleotides, or ten or more caged nucleotides.

In some embodiments, the capture probe binding domain includes a caged nucleotide at the 3′ end. In some embodiments, the capture probe binding domain includes two caged nucleotides at the 3′ end. In some embodiments, the capture probe binding domain includes at least three caged nucleotides at the 3′ end. In some embodiments, the capture probe binding domain includes a caged nucleotide at the 5′ end. In some embodiments, the capture probe binding domain includes two caged nucleotides at the 5′ end. In some embodiments, the capture probe binding domain includes at least three caged nucleotides at the 5′ end. In some embodiments, the capture probe binding domain includes a caged nucleotide at every odd position starting at the 3′ end or starting at the 5′ end. In some embodiments, the capture probe binding domain includes a sequence including at least 10%, at least, 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, or at least 90% caged nucleotides.

In some embodiments, the analyte, the first analyte, or the second analyte is a protein. In some embodiments, the protein is an antibody. In some embodiments, the analyte is RNA. In some embodiments, the RNA is an mRNA.

In some instances, disclosed herein is a kit comprising: (a) a substrate comprising a plurality of capture probes comprising a spatial barcode and a capture domain; (b) a system comprising: (i) a plurality of an analyte binding moieties, wherein an analyte-binding moiety of the plurality is associated with an oligonucleotide comprising an analyte-binding-moiety barcode and the capture binding domain, or (ii) a plurality of first probe oligonucleotides and second probe oligonucleotides, wherein a first probe oligonucleotide and a second probe oligonucleotide each comprises sequences that are substantially complementary to an analyte, and wherein the second probe oligonucleotide comprises a capture binding domain; (c) a plurality of blocking probes; and (d) instructions for performing the method of any one of the preceding claims.

In some instances, also disclosed is a kit comprising: (a) a substrate comprising a plurality of capture probes comprising a spatial barcode and a capture domain; (b) a system comprising: (i) a plurality of an analyte binding moieties, wherein an analyte-binding moiety of the plurality is associated with an oligonucleotide comprising an analyte-binding-moiety barcode and the capture binding domain, wherein the capture binding domain comprises a plurality of caged nucleotides, wherein a caged nucleotide of the plurality of caged nucleotides comprises a caged moiety that blocks the capture binding domain from hybridizing to the capture domain, or (ii) a plurality of first probe oligonucleotides and second probe oligonucleotides, wherein a first probe oligonucleotide and a second probe oligonucleotide each comprises sequences that are substantially complementary to an analyte, and wherein the second probe oligonucleotide comprises a capture binding domain, wherein the capture binding domain comprises a plurality of caged nucleotides, wherein a caged nucleotide of the plurality of caged nucleotides comprises a caged moiety that blocks the capture binding domain from hybridizing to the capture domain; and (c) instructions for performing the method of any one of the preceding claims.

All publications, patents, patent applications, and information available on the internet and mentioned in this specification are herein incorporated by reference to the same extent as if each individual publication, patent, patent application, or item of information was specifically and individually indicated to be incorporated by reference. To the extent publications, patents, patent applications, and items of information incorporated by reference contradict the disclosure contained in the specification, the specification is intended to supersede and/or take precedence over any such contradictory material.

Where values are described in terms of ranges, it should be understood that the description includes the disclosure of all possible sub-ranges within such ranges, as well as specific numerical values that fall within such ranges irrespective of whether a specific numerical value or specific sub-range is expressly stated.

The term “each,” when used in reference to a collection of items, is intended to identify an individual item in the collection but does not necessarily refer to every item in the collection, unless expressly stated otherwise, or unless the context of the usage clearly indicates otherwise.

Various embodiments of the features of this disclosure are described herein. However, it should be understood that such embodiments are provided merely by way of example, and numerous variations, changes, and substitutions can occur to those skilled in the art without departing from the scope of this disclosure. It should also be understood that various alternatives to the specific embodiments described herein are also within the scope of this disclosure.

DESCRIPTION OF DRAWINGS

The following drawings illustrate certain embodiments of the features and advantages of this disclosure. These embodiments are not intended to limit the scope of the appended claims in any manner. Like reference symbols in the drawings indicate like elements.

FIG. 1 is a schematic diagram of an exemplary analyte capture agent.

FIG. 2 is a schematic diagram depicting an exemplary interaction between a feature-immobilized capture probe 224 and an analyte capture agent 226.

FIGS. 3A, 3B, and 3C are schematics illustrating how streptavidin cell tags can be utilized in an array-based system to produce a spatially-barcoded cells or cellular contents.

FIG. 4A is an exemplary schematic showing an analyte binding moiety comprising an oligonucleotide having a capture binding domain (indicated by a poly(A) sequence) that is hybridized to a blocking domain (indicated by a poly(T) sequence).

FIG. 4B is an exemplary schematic showing an oligonucleotide probe (RHS) from an RNA-templated ligation reaction. The probe includes a capture binding domain (indicated by a poly(A) sequence) hybridized to a blocking domain (indicated by a poly(T) sequence).

FIG. 5A is an exemplary schematic showing an analyte binding moiety that includes an oligonucleotide comprising a hairpin sequence disposed between a blocking domain (indicated by a poly(U) sequence) and a capture binding domain (indicated by a poly(A) sequence). As shown, the blocking domain hybridizes to the capture binding domain.

FIG. 5B is an exemplary schematic showing an oligonucleotide probe (RHS) from an RNA-templated ligation reaction. The probe includes an oligonucleotide comprising a hairpin sequence disposed between a blocking domain (indicated by a poly(U) sequence) and a capture binding domain (indicated by a poly(A) sequence). As shown, the blocking domain hybridizes to the capture binding domain.

FIGS. 6A and 6B are exemplary schematics showing a blocking domain released by RNAse H.

FIG. 7A is an exemplary schematic showing an analyte binding moiety that includes an oligonucleotide comprising a capture binding domain that is blocked using caged nucleotides (indicated by pentagons).

FIG. 7B is an exemplary schematic showing an oligonucleotide probe (RHS) from an RNA-templated ligation reaction. The probe comprises a capture binding domain that is blocked using caged nucleotides.

FIG. 7C is an exemplary schematic of an analyte binding moiety that includes a capture binding domain wherein the caged nucleotide are undergoing photolysis activation using light pulses.

FIG. 7D is an exemplary schematic showing an RHS probe that includes a capture binding domain wherein the caged nucleotides are undergoing photolysis activation using light pulses.

FIG. 8 is a schematic diagram showing an example of a barcoded capture probe, as described herein.

DETAILED DESCRIPTION I. Introduction

Spatial analysis methodologies can provide a vast amount of analyte level and/or expression data for a variety of multiple analytes within a sample at high spatial resolution, e.g., while retaining the native spatial context. Spatial analysis methods can include, e.g., the use of a capture probe including a spatial barcode (e.g., a nucleic acid sequence that provides information as to the position of the capture probe within a cell or a tissue sample (e.g., mammalian cell or a mammalian tissue sample) and a capture domain that is capable of binding to an analyte (e.g., a protein and/or nucleic acid) produced by and/or present in a cell.

Spatial analysis allows for the binding of analytes within a biological sample to capture domains. Oftentimes, both the analyte of interest and the capture domain of the capture probe are oligonucleotides. For example, an mRNA from a cell, a proxy of an mRNA, or another nucleic acid that can be used to identify the presence of the mRNA in a sample binds to a poly(T) sequence of a capture domain. By doing so, spatial analysis is able to determine where spatially that mRNA is being expressed in a sample and approximately how much of the mRNA is present at high resolution. Oftentimes there is background noise in such a system, for example background binding of the analyte, analyte proxy, or other analyte identifier to areas of the array, for example to capture domains that are not under the tissue but instead are on the array surface that is not covered by a tissue.

The disclosed methods provide solutions to the problem of background binding by inhibiting the interaction of the analytes with the capture domains and relieving that inhibition when analyte binding for spatial analysis is desired, thereby enhancing the specificity of the analytes binding in their spatial location.

Tissues and cells can be obtained from any source. For example, tissues and cells can be obtained from single-cell or multicellular organisms (e.g., a mammal). Tissues and cells obtained from a mammal, e.g., a human, often have varied analyte levels (e.g., gene and/or protein expression) which can result in differences in cell morphology and/or function. The position of a cell or a subset of cells (e.g., neighboring cells and/or non-neighboring cells) within a tissue can affect, e.g., the cell’s fate, behavior, morphology, and signaling and crosstalk with other cells in the tissue. Information regarding the differences in analyte levels (gene and/or protein expression) within different cells in a tissue of a mammal can also help physicians select or administer a treatment that will be effective and can allow researchers to identify and elucidate differences in cell morphology and/or cell function in the single-cell or multicellular organisms (e.g., a mammal) based on the detected differences in analyte levels within different cells in the tissue. Differences in analyte levels within different cells in a tissue of a mammal can also provide information on how tissues (e.g., healthy and diseased tissues) function and/or develop.

Non-limiting aspects of spatial analysis methodologies are described in WO 2011/127099, WO 2014/210233, WO 2014/210225, WO 2016/162309, WO 2018/091676, WO 2012/140224, WO 2014/060483, WO 2020/176788 U.S. Pat. No. 10,002,316, U.S. Pat. No. 9,727,810, U.S. Pat. Application Publication No. 2020/0277663, U.S. Pat. Application Publication No. 2017/0016053, Rodriques et al., Science 363(6434):1463-1467, 2019; WO 2018/045186, Lee et al., Nat. Protoc. 10(3):442-458, 2015; WO 2016/007839, WO 2018/045181, WO 2014/163886, Trejo et al., PLoS ONE 14(2):e0212031, 2019, U.S. Pat. Application Publication No. 2018/0245142, Chen et al., Science 348(6233):aaa6090, 2015, Gao et al., BMC Biol. 15:50, 2017, WO 2017/144338, WO 2018/107054, WO 2017/222453, WO 2019/068880, WO 2011/094669, U.S. Pat. No. 7,709,198, U.S. Pat. No. 8,604,182, U.S. Pat. No. 8,951,726, U.S. Pat. No. 9,783,841, U.S. Pat. No. 10,041,949, WO 2016/057552, WO 2017/147483, WO 2018/022809, WO 2016/166128, WO 2017/027367, WO 2017/027368, WO 2018/136856, WO 2019/075091, U.S. Pat. No. 10,059,990, WO 2018/057999, WO 2015/161173, and Gupta et al., Nature Biotechnol. 36:1197-1202, 2018, each of which is incorporated by reference in its entirety, and can be used herein in any combination. Further non-limiting aspects of spatial analysis methodologies are described herein.

Some general terminology that may be used in this disclosure can be found WO 2020/176788 and/or U.S. Pat. Application Publication No. 2020/0277663, each of which is incorporated by reference in its entirety. Typically, a “barcode” is a label, or identifier, that conveys or is capable of conveying information (about an analyte in a sample, a bead, and/or a capture probe. A barcode can be part of an analyte, or independent of an analyte. A barcode can be attached to an analyte. A particular barcode can be unique relative to other barcodes. For the purpose of this disclosure, an “analyte” can include any biological substance, structure, moiety, or component to be analyzed. The term “target” can similarly refer to an analyte of interest. Analytes can be broadly classified into one of two groups: nucleic acid analytes, and non-nucleic acid analytes. Examples of non-nucleic acid analytes include, but are not limited to, lipids, carbohydrates, peptides, proteins, glycoproteins (N-linked or O-linked), lipoproteins, phosphoproteins, specific phosphorylated or acetylated variants of proteins, amidation variants of proteins, hydroxylation variants of proteins, methylation variants of proteins, ubiquitylation variants of proteins, sulfation variants of proteins, viral coat proteins, extracellular and intracellular proteins, antibodies, and antigen binding fragments. In some embodiments, the analyte(s) can be localized to subcellular location(s), including, for example, organelles, e.g., mitochondria, Golgi apparatus, endoplasmic reticulum, chloroplasts, endocytic vesicles, exocytic vesicles, vacuoles, lysosomes, etc. In some embodiments, analyte(s) can be peptides or proteins, including without limitation antibodies and enzymes. Additional examples of analytes can be found in WO 2020/176788 and/or U.S. Pat. Application Publication No. 2020/0277663, each of which is incorporated by reference in its entirety.

A “biological sample” is typically obtained from the subject for analysis using any of a variety of techniques including, but not limited to, biopsy, surgery, and laser capture microscopy (LCM), and generally includes cells and/or other biological material from the subject. In some embodiments, a biological sample can be a tissue section. In some embodiments, a biological sample can be a fixed and/or stained biological sample (e.g., a fixed and/or stained tissue section). In some embodiments biological sample can be a cell culture sample. In some embodiments, a biological sample can be nervous tissue, blood, serum, plasma, cerebrospinal fluid, or bone marrow aspirate. Biological samples are also described in WO 2020/176788 and/or U.S. Pat. Application Publication No. 2020/0277663, each of which is incorporated by reference in its entirety.

In addition, spatial analysis methods can be performed on various types of samples, including tissues (e.g., tissue slices) or single cells (e.g., cultured cells). Exemplary methods and compositions relating to tissue or single-cell spatial analysis is found at least in WO 2020/176788 and/or U.S. Pat. Application Publication No. 2020/0277663, each of which is incorporated by reference in its entirety. In some instances, one biological sample can be used for tissue and single cell analysis. For example, multiple serial slices (e.g., 10 µm in thickness) of a tissue can be cut. A first slice can be placed on an array and analyte capture as described herein can be performed. In some instances, a second slice of tissue can further undergo cellular dissociation, creating a sample with isolated cells that can be analyzed using spatial analysis methods. Briefly, in some instances, a tissue is minced into small pieces and treated with lysis buffer to homogenize the sample. The homogenous resultant can be filtered and centrifuged to collect a pellet of nuclei. The nuclei can be resuspended and used for single cell analysis methods described herein. Data captured from the second slice (i.e., the single nuclei data) could then be combined with the data from the first slice (i.e., the whole tissue data) to gain a higher cell type understanding and potentially deconvolve the cell type identity within each spot on the array. Additional methods of single cell isolation is found in Hu et al., Mol Cell. 2017 Dec 7;68(5):1006-1015.e7; Habib et al., Science, 2016 Aug 26;353(6302):925-8; Habib et al., Nat Methods, 2017 Oct;14(10):955-958; Lake et al., Science, 2016 Jun 24;352(6293):1586-90; and Lacar et al., Nat Commun, 2016 Apr 19;7:11022; each of which is incorporated by reference in its entirety.

In another embodiment, two different samples are collected, whereby one sample is analyzed with intact tissue and a second tissue undergoes cell dissociation. Results from each biological sample can be compared to gain a higher cell type understanding and potentially deconvolve the cell type identity within each spot on the array. Array-based spatial analysis methods involve the transfer of one or more analytes from a biological sample to an array of features on a substrate, where each feature is associated with a unique spatial location on the array. Subsequent analysis of the transferred analytes includes determining the identity of the analytes and the spatial location of each analyte within the biological sample. The spatial location of each analyte within the biological sample is determined based on the feature to which each analyte is bound on the array, and the feature’s relative spatial location within the array.

A “capture probe” refers to any molecule capable of capturing (directly or indirectly) and/or labelling an analyte (e.g., an analyte of interest) in a biological sample. In some embodiments, the capture probe is a nucleic acid or a polypeptide. In some embodiments, the capture probe includes a barcode (e.g., a spatial barcode and/or a unique molecular identifier (UMI)) and a capture domain. In some instances, the capture probe can include functional sequences that are useful for subsequent processing. As exemplified in FIG. 8 , a capture probe 802 can be reversibly attached to a substrate 801 via a linker 803. The capture probe can include one or more functional sequences 804, which can include a sequencer specific flow cell attachment sequence, e.g., a P5 or P7 sequence, as well as functional sequence 806, which can include sequencing primer sequences, e.g., a R1 primer binding site, a R2 primer binding site. In some embodiments, sequence 804 is a P7 sequence and sequence 806 is a R2 primer binding site. A capture probe can additionally include a spatial barcode and/or unique molecular identifier 805 and a capture domain 807. The different sequences of the capture probe need not be in the sequential manner as depicted in this example, however the capture domain 807 should be placed in a location on the barcode wherein analyte capture and extension of the capture domain to create a copy of the analyte can occur. Additional features of capture probes are described in WO 2020/176788 and/or U.S. Pat. Application Publication No. 2020/0277663, each of which is incorporated by reference in its entirety. Generation of capture probes can be achieved by any appropriate method, including those described in WO 2020/176788 and/or U.S. Pat. Application Publication No. 2020/0277663, each of which is incorporated by reference in its entirety.

In some embodiments, more than one analyte type from a biological sample can be detected (e.g., simultaneously or sequentially) using any appropriate multiplexing technique, such as those described in WO 2020/176788 and/or U.S. Pat. Application Publication No. 2020/0277663, each of which is incorporated by reference in its entirety.

In some embodiments, detection of one or more analytes (e.g., protein analytes) can be performed using one or more analyte capture agents. As used herein, an “analyte capture agent” refers to an agent that interacts with an analyte (e.g., an analyte in a sample) and with a capture probe (e.g., a capture probe attached to a substrate) to identify the analyte. In some embodiments, the analyte capture agent includes an analyte binding moiety and a capture agent barcode domain. Additional description of analyte capture agents can be found in WO 2020/176788 and/or U.S. Pat. Application Publication No. 2020/0277663, each of which is incorporated by reference in its entirety.

There are at least two general methods to associate a spatial barcode with one or more neighboring cells, such that the spatial barcode identifies the one or more cells, and/or contents of the one or more cells, as associated with a particular spatial location. One general method is to promote analytes out of a cell and towards a spatially-barcoded array (e.g., including spatially-barcoded capture probes). Another general method is to cleave spatially-barcoded capture probes from an array and promote the spatially-barcoded capture probes towards and/or into or onto the biological sample.

In some cases, capture probes may be configured to prime, replicate, and consequently yield optionally barcoded extension products from a template (e.g., a DNA or RNA template), or derivatives thereof (see, e.g., WO 2020/176788 and/or U.S. Pat. Application Publication No. 2020/0277663 regarding extended capture probes, the references of which are incorporated by reference in their entireties.). For example, in some cases, the capture probes may include mRNA specific priming sequences, e.g., poly-T primer segments that allow priming and replication of mRNA in a reverse transcription reaction or other targeted priming sequences. Alternatively or additionally, random RNA priming may be carried out using random N-mer primer segments of the barcoded oligonucleotides. Reverse transcriptases (RTs) can use an RNA template and a primer complementary to the 3′ end of the RNA template to direct the synthesis of the first strand complementary DNA (cDNA). Additional variants of spatial analysis methods, including in some embodiments, an imaging step, are described in WO 2020/176788 and/or U.S. Pat. Application Publication No. 2020/0277663, each of which is incorporated by reference in its entirety. Analysis of captured analytes, for example, including sample removal, extension of capture probes, sequencing (e.g., of a cleaved extended capture probe and/or a cDNA molecule complementary to an extended capture probe), sequencing on the array (e.g., using in situ hybridization approaches), temporal analysis, and/or proximity capture, is described in WO 2020/176788 and/or U.S. Pat. Application Publication No. 2020/0277663. Some quality control measures are described in WO 2020/176788 and/or U.S. Pat. Application Publication No. 2020/0277663, each of which is incorporated by reference in its entirety.

Typically, for spatial array-based analytical methods, a substrate functions as a support for direct or indirect attachment of capture probes to features of the array. A “feature” on an array acts as a locational support or repository for various molecular entities used in sample analysis. In some embodiments, some or all of the features in an array are functionalized for analyte capture. Exemplary substrates are described in WO 2020/176788 and/or U.S. Pat. Application Publication No. 2020/0277663, each of which is incorporated by reference in its entirety. Exemplary features and geometric attributes of an array can be found in WO 2020/176788 and/or U.S. Pat. Application Publication No. 2020/0277663, each of which is incorporated by reference in its entirety.

Generally, analytes can be captured when contacting a biological sample with a substrate including capture probes (e.g., substrate with capture probes embedded, spotted, printed on the substrate or a substrate with features (e.g., beads, wells) comprising capture probes). As used herein, “contact,” “contacted,” and/ or “contacting,” a biological sample with a substrate refers to any contact (e.g., direct or indirect) such that capture probes can interact (e.g., bind covalently or non-covalently (e.g., hybridize)) with analytes from the biological sample. Capture can be achieved actively (e.g., using electrophoresis) or passively (e.g., using diffusion). Analyte capture is further described in WO 2020/176788 and/or U.S. Pat. Application Publication No. 2020/0277663, each of which is incorporated by reference in its entirety.

In some cases, a spatial analysis can be performed by attaching and/or introducing a molecule (e.g., a peptide, a lipid, or a nucleic acid molecule) having a barcode (e.g., a spatial barcode) to a biological sample (e.g., to a cell in a biological sample). In some embodiments, a plurality of molecules (e.g., a plurality of nucleic acid molecules) having a plurality of barcodes (e.g., a plurality of spatial barcodes) are introduced to a biological sample (e.g., to a plurality of cells in a biological sample) for use in spatial analysis. In some embodiments, after attaching and/or introducing a molecule having a barcode to a biological sample, the biological sample can be separated into single cells or cell groups for analysis. Some such methods of spatial analysis are described in WO 2020/176788 and/or U.S. Pat. Application Publication No. 2020/0277663, each of which is incorporated by reference in its entirety.

Some exemplary spatial analysis workflows are described in WO 2020/176788 and/or U.S. Pat. Application Publication No. 2020/0277663, each of which is incorporated by reference in its entirety.

In some embodiments, a spatial analysis can be performed using dedicated hardware and/or software, such as any of the systems described in WO 2020/176788 and/or U.S. Pat. Application Publication No. 2020/0277663, each of which is incorporated by reference in its entirety.

II. Methods and Compositions for Enhancing the Specificity of Target Analyte Capture

In some embodiments, provided herein are methods and materials for enhancing the specificity of binding of an antigen-binding moiety or a probe oligonucleotide to a target analyte in a biological sample by blocking the premature interactions between the antigen-binding moiety and/or probe oligonucleotide and a capture domain of a capture probe on a substrate. In instances without a blocking step, the capture binding domain (i.e., associated with the antigen-binding moiety or as a component of one of the probe oligonucleotides for RNA templated ligation) is capable of hybridizing to the capture domain of a probe affixed on a substrate instead of its target analyte thereby decreasing the sensitivity of an assay and creating background non-specific binding. For example, when using an antigen-binding moiety, the oligonucleotide of the moiety which comprises a capture domain complementary to the capture domain on the probe affixed to a substrate, if unblocked, can hybridize to the capture domain of the probe on a substrate surface before the antigen-binding moiety associates with its target analyte. In another example, for RNA-templated ligation an oligonucleotide probe used in RNA-templated ligation comprising a capture domain complementary to the capture domain on the probe affixed to a substrate can hybridize to the capture probe that is located on a substrate surface before (1) hybridization to an analyte of interest, (2) ligation of the oligonucleotides, or (3) both.

This disclosure describes solutions to these unwanted capture domain interactions by describing reversible methods that temporally block premature or unwanted hybridization of the capture binding domain of the analyte capture molecule and the capture probe on the surface of a substrate. The methods herein utilize one or more blocking probes to hybridize to a capture binding domain and temporarily block premature or unwanted hybridization. In some instances, the methods herein include steps of blocking a capture probe on a substrate. In some instances, the capture probe is blocked by any one of the blocking probes disclosed herein. In some instances, the capture probe is blocked at the capture domain. As used herein, a “blocking probe” is a molecule that affixes (e.g., hybridizes) to a capture binding domain and temporarily blocks premature or unwanted hybridization. A blocking probe can be a nucleic acid. In some instances, the blocking probe is a DNA molecule. In some instances, the blocking probe is an RNA molecule. In some instances, the blocking probe is a hybrid DNA-RNA molecule. The blocking probe can be synthetic or natural and as described herein, it can include one or more non-naturally occurring nucleotides. In some instances, the blocking probe includes full or partial (e.g., at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99%) complementarity to the capture binding domain. In some instances, the blocking probe is at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least about 15, at least about 20, at least about 25, at least about 30, at least about 35, at least about 40, at least about 45, at least about 50, at least about 55, at least about 60, at least about 65, at least about 70, at least about 75, at least about 80, at least about 85, at least about 90, at least about 95, or at least about 100 nucleotides in length. As such, the methods disclosed herein increase the efficiency of target analyte hybridization compared to methods that do not temporally block the capture binding domain. Further, the efficiency of downstream analyses (e.g., amplification and sequencing) using the methods disclosed herein is increased compared to methods that do not temporally block the capture binding domain because fewer false positive sequences are identified and sequenced. The methods disclosed herein are applicable to methods of detecting proteins or nucleic acids. For example, with respect to protein analyte detection, provided herein are methods for enhancing the specificity of binding of an analyte-binding moiety to a target analyte in a biological sample, where the antigen-binding moiety is attached to an oligonucleotide having at least an analyte-binding-moiety barcode and a capture binding domain that is complementary to a capture probe, and where the capture binding domain is blocked from hybridizing to a capture binding domain using either caged nucleotides or by a sequence complementary to the capture binding domain. In some instances, the sequence complementary to the capture binding domain is on the same strand as the capture binding domain and folds to create a hairpin structure (see, e.g., FIGS. 5A and 5B). In some instances, the hairpin structure blocks access to the capture binding domain. With respect to nucleic acid analyte detection, also provided herein are methods for enhancing the specificity of binding of a probe oligonucleotide to a target analyte in a biological sample, where the probe oligonucleotide includes at least a sequence that specifically binds to a target analyte and a capture binding domain that is complementary to a capture probe, where the capture binding domain is blocked from hybridizing to a capture domain of the capture probe using either caged nucleotides or by a sequence complementary to the capture binding domain. By blocking the interaction of the capture binding domain with the capture domain of the capture probe on a substrate, the methods provided herein enable greater specificity in binding an antigen-binding moiety or a probe oligonucleotide to its target analyte and increase resolution of a spatial array by reducing background noise (e.g., non-specific binding). In addition, the disclosure provides methods for releasing the block, thereby providing temporal and spatial control over the interaction between an antigen-binding moiety and/or a probe oligonucleotide with a capture domain of a capture probe on a substrate at the time point when such binding occurs during a workflow.

This disclosure features methods wherein the specificity of binding of an oligonucleotide to a target analyte is increased by about 5%, by about 10%, by about 15%, by about 20%, by about 25%, by about 30%, by about 35%, by about 40%, by about 45%, by about 50%, by about 55%, by about 60%, by about 65%, by about 70%, by about 75%, by about 80%, by about 85%, by about 90%, by about 95%. In some embodiments, the methods increased specificity of binding of an oligonucleotide to a target analyte by about 1.5-fold, by about 2.0-fold, by about 2.5-fold, by about 3.0-fold, by about 3.5-fold, by about 4.0-fold, by about 4.5-fold, by about 5.0-fold, by about 6-fold, by about 7-fold, by about 8-fold, by about 9-fold, by about 10-fold, or more compared to a setting in which the capture binding domain is not blocked.

This disclosure features methods wherein the specificity of binding of a capture binding domain to a capture domain of a capture probe is increased by about 5%, by about 10%, by about 15%, by about 20%, by about 25%, by about 30%, by about 35%, by about 40%, by about 45%, by about 50%, by about 55%, by about 60%, by about 65%, by about 70%, by about 75%, by about 80%, by about 85%, by about 90%, by about 95%. In some embodiments, the methods increased specificity of binding of a capture binding domain to a capture domain of a capture probe by about 1.5-fold, by about 2.0-fold, by about 2.5-fold, by about 3.0-fold, by about 3.5-fold, by about 4.0-fold, by about 4.5-fold, by about 5.0-fold, by about 6-fold, by about 7-fold, by about 8-fold, by about 9-fold, by about 10-fold, or more compared to a setting in which the capture binding domain is not blocked.

(A) Blocking Probes and the Capture Binding Domain

In some instances, disclosed are blocking probes that hybridize to a capture binding domain. In some instances, the blocking probes are independent nucleotides from the capture binding domain. In some instances, the blocking probes and capture binding domain are on a contiguous nucleotide sequence and form a hairpin upon interaction.

In some embodiments, the capture binding domain can include a sequence that is at least partially complementary to a sequence of a capture domain of a capture probe (e.g., any of the exemplary capture domains described herein). FIG. 4A shows an exemplary capture binding domain attached to an analyte-binding moiety used to detect a protein in a biological sample. As show in FIG. 4A, an analyte-binding moiety 401 includes an oligonucleotide that includes a primer (e.g., a read2) sequence 402, an analyte-binding-moiety barcode 403, a capture binding domain having a first sequence (i.e., a capture binding domain) 404 (e.g., an exemplary poly A), and a blocking probe or second sequence 405 (e.g., poly T or poly U), where the blocking sequence blocks the capture binding domain from hybridizing to a capture domain on a capture probe. In some instances, the blocking sequence 405 is called a blocking probe as disclosed herein. In some instances, the blocking probe is a poly T sequence as exemplified in FIG. 4A.

FIG. 4B shows an exemplary capture binding domain attached to a probe oligonucleotide (e.g., a right hand (e.g., an RHS) probe). As shown in FIG. 4B, the methods include a probe oligonucleotide 406 (e.g., a second probe oligonucleotide that can be used in a RNA templated ligation as described below) that includes a first sequence (i.e., a capture binding domain) 404 (e.g., exemplary poly A) and a blocking sequence 405 (e.g., a poly T or a poly U), where the blocking sequence blocks the capture binding domain from hybridizing to a capture domain on a capture probe. In some instances, the blocking sequence 405 is called a blocking probe or second sequence as disclosed herein. In some instances, the blocking probe is a poly T sequence.

In some instances, as shown in FIGS. 4A and 4B, the blocking probe sequence is not on a contiguous sequence with the capture binding domain. In other words, in some instances, the capture binding domain (also herein called a first sequence) and the blocking sequence are independent polynucleotides. In some instances, it will be apparent to one skilled in the art that the terms “capture binding domain” and “first sequence” are used interchangeably in this disclosure.

In a non-limiting example, the first sequence can be a poly(A) sequence when the capture domain sequence of the capture probe on the substrate is a poly(T) sequence. In some embodiments, the capture binding domain includes a capture binding domain substantially complementary to the capture domain of the capture probe. By substantially complementary, it is meant that the first sequence of the capture binding domain is at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% complementary to a sequence in the capture domain of the capture probe. In another example, the first sequence of the capture binding domain can be a random sequence (e.g., random hexamer) that is at least partially complementary to a capture domain sequence of the capture probe that is also a random sequence. In yet another example, a capture binding domain can be a mixture of a homopolymeric sequence (e.g., a poly(T) sequence) and a random sequence (e.g., random hexamer) when a capture domain sequence of the capture probe is also a sequence that includes a homopolymeric sequence (e.g., a poly(A) sequence) and a random sequence. In some embodiments, the capture binding domain includes ribonucleotides, deoxyribonucleotides, and/or synthetic nucleotides that are capable of participating in Watson-Crick type or analogous base pair interactions. In some embodiments, the first sequence of the capture binding domain sequence includes at least 10 nucleotides, at least 11 nucleotides, at least 12 nucleotides, at least 13 nucleotides, at least 14 nucleotides, at least 15 nucleotides, at least 16 nucleotides, at least 17 nucleotides, at least 18 nucleotides, at least 19 nucleotides, at least 20 nucleotides, at least 21 nucleotides, at least 22 nucleotides, at least 23 nucleotides, or at least 24 nucleotides. In some embodiments, the first sequence of the capture binding domain includes at least 25 nucleotides, at least 30 nucleotides, or at least 35 nucleotides.

In some embodiments, the capture binding domain (i.e., the first sequence) and the blocking probe (i.e., the second sequence) of the capture binding domain are located on the same contiguous nucleic acid sequence. Where the capture binding domain and the blocking probe are located on the same contiguous nucleic acid sequence, the second sequence (e.g., a blocking probe) is located 3′ of the first sequence. Where the first sequence and the second sequence (e.g., a blocking probe) of the capture binding domain are located on the same contiguous nucleic acid sequence, the second sequence (e.g., the blocking probe) is located 5′ of the first sequence. As used herein, the terms second sequence and blocking probe are used interchangeably.

In some instances, the second sequence (e.g., the blocking probe) of the capture binding domain includes a nucleic acid sequence. In some instances, the second sequence is also called a blocking probe or blocking domain, and each term is used interchangeably. In some instances, the blocking domain is a DNA oligonucleotide. In some instances, the blocking domain is an RNA oligonucleotide. In some embodiments, a blocking probe of the capture binding domain includes a sequence that is complementary or substantially complementary to a first sequence of the capture binding domain. In some embodiments, the blocking probe prevents the first sequence of the capture binding domain from binding the capture domain of the capture probe when present. In some embodiments, the blocking probe is removed prior to binding the first sequence of the capture binding domain (e.g., present in a ligated probe) to a capture domain on a capture probe. In some embodiments, a blocking probe of the capture binding domain includes a poly-uridine sequence, a poly-thymidine sequence, or both. In some instances, the blocking probe (or the second sequence) is part of a hairpin structure that specifically binds to a capture binding domain and prevents the capture binding domain from hybridizing to a capture domain of a capture probe. See e.g., FIGS. 5A and 5B.

In some embodiments, the second sequence (e.g., the blocking probe) of the capture binding domain includes a sequence configured to hybridize to the first sequence of the capture binding domain. When the blocking probe is hybridized to the first sequence, the first sequence is blocked from hybridizing with a capture domain of a capture probe. In some embodiments, the blocking probe includes a sequence that is complementary to the first sequence. In some embodiments, the blocking probe includes a sequence that is substantially complementary to the first sequence. In some embodiments, the blocking probe includes a sequence that is at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% complementary to the first sequence of the capture binding domain.

In some embodiments, the blocking probe of the capture binding domain includes a homopolymeric sequence that is substantially complementary to the first sequence of the capture binding domain. In some embodiments, the blocking probe is configured to hybridize to a poly(A), poly(T), or a poly-rU sequence. In some embodiments, the blocking probe includes a poly(A), poly(T), or a poly(U) sequence. In some embodiments, the first sequence includes a homopolymeric sequence. In some embodiments, the first sequence includes a poly(A), poly(U), or a poly(T) sequence.

In some embodiments, the capture binding domain further includes a hairpin sequence (as shown in FIGS. 5A and 5B). FIG. 5A shows an exemplary capture binding domain attached to an analyte-binding moiety used to detect a protein in a biological sample. As show in FIG. 5A, an analyte-binding moiety 501 includes an oligonucleotide that includes a primer (e.g., a read2) sequence 502, an analyte-binding-moiety barcode 503, a capture binding domain having a first sequence 504 (e.g., an exemplary poly A), a blocking probe 505 and a third sequence 506, wherein the second and/or third sequence can be poly T or poly U or a combination thereof, where the blocking probe creates a hairpin type structure and the third sequence blocks the first sequence from hybridizing to a capture domain on a capture probe. In some instances, the third sequence 506 is called a blocking. Further, 508 exemplifies a nuclease capable of digesting the blocking sequencing. In this example, 508 could be an endonuclease or mixture of nucleases capable of digesting uracils, such as UDG or a uracil specific excision mix such as USER (NEB).

FIG. 5B shows an exemplary capture binding domain attached to a probe oligonucleotide (e.g., a right hand (e.g., an RHS) probe). As shown in FIG. 5B, the methods include a probe oligonucleotide 507 (e.g., a second probe oligonucleotide that can be used in a RNA templated ligation as described below) that includes a first sequence 504 (e.g., exemplary poly A), a blocking probe 505 and a third sequence 506 wherein the second and/or third sequence can be poly T or a poly U or a combination thereof, where the blocking probe creates a hairpin type structure and the third sequence blocks the first sequence from hybridizing to a capture domain on a capture probe. In some instances, the third sequence 506 is called a blocking probe. In this example. In this example, 508 could be an endonuclease or mixture of nucleases capable of digesting uracils, such as UDG or a uracil specific excision mix such as USER (NEB).

Another embodiment of a hairpin blocker scenario is exemplified in FIGS. 6A and 6B. As exemplified in FIG. 6A, an analyte-binding moiety 601 includes an oligonucleotide that includes a primer (e.g., a read2) sequence 602, an analyte-binding-moiety barcode 603, a capture binding domain having a first sequence (i.e., a capture binding domain) 604 (e.g., an exemplary poly A), a second hairpin sequence 605 and a third sequence 606, where the third sequence (i.e., a blocking probe) blocks the first sequence from hybridizing to a capture domain on a capture probe. In this example, 608 exemplifies an RNase H nuclease capable of digesting the uracil blocking sequencing from the DNA:RNA hybrid that is formed by blocking of the first sequence with a uracil containing third sequence.

As exemplified in FIG. 6B, the methods include a probe oligonucleotide 607 (e.g., a second probe oligonucleotide that can be used in a RNA templated ligation as described below) that includes a first sequence 604 (e.g., exemplary poly A), a second hairpin sequence 605 and a third sequence 606 where the third sequence blocks the first sequence from hybridizing to a capture domain on a capture probe. In this example, 608 exemplifies an RNase H nuclease capable of digesting the uracil blocking sequencing from the DNA:RNA hybrid that is formed by blocking of the first sequence with a uracil containing third sequence.

In some embodiments, the hairpin sequence is located 5′ of the blocking probe in the capture binding domain. In some embodiments, the hairpin sequence is located 5′ of the first sequence in the capture binding domain. In some embodiments, the capture binding domain includes from 5′ to 3′ a first sequence substantially complementary to the capture domain of a capture probe, a hairpin sequence, and a blocking probe substantially complementary to the first sequence. Alternatively, the capture binding domain includes from 3′ to 5′ a first sequence substantially complementary to the capture domain of a capture probe, a hairpin sequence, and a blocking probe substantially complementary to the first sequence.

In some embodiments, the hairpin sequence includes a sequence of about three nucleotides, about four nucleotides, about five nucleotides, about six nucleotides, about seven nucleotides, about eight nucleotides, about nine nucleotides or about 10 or more nucleotides. In some instances, the hairpin is at least about 15 nucleotides, at least about 20 nucleotides, at least about 25 nucleotides, at least about 30 nucleotides, or more nucleotides.

In some embodiments, the hairpin sequence includes DNA, RNA, DNA-RNA hybrid, or includes modified nucleotides. In some instances, the hairpin is a poly(U) sequence. In some instances, the RNA hairpin sequence is digested by USER and/or RNAse H using methods disclosed herein. In some instances, the poly(U) hairpin sequence is digested by USER and/or RNAse H using methods disclosed herein. In some instances, the hairpin is a poly(T) sequence. It is appreciated that the sequence of the hairpin (whether it includes DNA, RNA, DNA-RNA hybrid, or includes modified nucleotides) can be nearly any nucleotide sequence so long as it forms a hairpin, and in some instances, so long as it is digested by USER and/or RNAse H.

In some embodiments, methods provided herein require that the second sequence (e.g., the blocking probe) of the capture binding domain that is hybridized to the first sequence of the capture binding domain is released from the first sequence. In some embodiments, releasing the blocking probe (or second sequence) from the first sequence is performed under conditions where the blocking probe de-hybridizes from the first sequence.

In some embodiments, releasing the blocking probe from the first sequence includes cleaving the hairpin sequence. In some embodiments, the hairpin sequence includes a cleavable linker. For example, the cleavable linker can be a photocleavable linker, UV-cleavable linker, or an enzyme-cleavable linker. In some embodiments, the enzyme that cleaves that enzymatic-cleavable domain is an endonuclease. In some embodiments, the hairpin sequence includes a target sequence for a restriction endonuclease.

In some embodiments, releasing the blocking probe (or the second sequence) of the capture binding domain that is hybridized to the first sequence of the capture binding domain includes contacting the blocking probe with a restriction endonuclease. In some embodiments, releasing the blocking probe from the first sequence includes contacting the blocking probe with an endoribonuclease. In some embodiments, when the blocking probe is an RNA sequence (e.g., a sequence comprising uracils) the endoribonuclease is one or more of RNase H, RNase A, RNase C, or RNase I. In some embodiments, wherein the endoribonuclease is RNase H. In some embodiments, the RNase H includes RNase H1, RNase H2, or RNase H1 and RNase H2.

In some embodiments, the hairpin sequence includes a homopolymeric sequence. In some embodiments, the hairpin sequence includes a poly(T) or poly(U) sequence. For example, the hairpin sequence includes a poly(U) sequence. In some embodiments, provided herein are methods for releasing the blocking probe by contacting the hairpin sequence with a Uracil-Specific Excision Reagent (USER) enzyme.

In some embodiments, releasing the blocking probe from the first sequence includes denaturing the blocking probe under conditions where the blocking probe de-hybridizes from the first sequence. In some embodiments, denaturing comprises using chemical denaturation or physical denaturation. For example, wherein physical denaturation (e.g., temperature) is used to release the blocking probe. In some embodiments, denaturing includes temperature modulation. For example, a first sequence and a blocking probe have predetermined annealing temperatures based on the composition (A, G, C, or T) within the known sequences. In some embodiments, the temperature is modulate up to 5° C., up to 10° C., up to 15° C., up to 20° C., up to 25° C., up to 30° C., or up to 35° C. above the predetermined annealing temperature. In some embodiments, the temperature is modulated at 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, or 35° C. above the predetermined annealing temperature. In some embodiments, once the temperature is modulated to a temperature above the predetermined annealing temperature, the temperature is cooled down to the predetermined annealing temperature at a ramp rate of about 0.1° C./second to about 1.0° C./second (e.g., about 0.1° C./second to about 0.9° C./second, about 0.1° C./second to about 0.8° C./second, about 0.1° C./second to about 0.7° C./second, about 0.1° C./second to about 0.6° C./second, about 0.1° C./second to about 0.5° C./second, about 0.1° C./second to about 0.4° C./second, about 0.1° C./second to about 0.3° C./second, about 0.1° C./second to about 0.2° C./second, about 0.2° C./second to about 1.0° C./second, about 0.2° C./second to about 0.9° C./second, about 0.2° C./second to about 0.8° C./second, about 0.2° C./second to about 0.7° C./second, about 0.2° C./second to about 0.6° C./second, about 0.2° C./second to about 0.5° C./second, about 0.2° C./second to about 0.4° C./second, about 0.2° C./second to about 0.3° C./second, about 0.3 to about 1.0° C./second, about 0.3° C./second to about 0.9° C./second, about 0.3° C./second to about 0.8° C./second, about 0.3° C./second to about 0.7° C./second, about 0.3° C./second to about 0.6° C./second, about 0.3° C./second to about 0.5° C./second, about 0.3° C./second to about 0.4° C./second, about 0.4° C./second to about 1.0° C./second, about 0.4° C./second to about 0.9° C./second, about 0.4° C./second to about 0.8° C./second, about 0.4° C./second to about 0.7° C./second, about 0.4° C./second to about 0.6° C./second, about 0.4° C./second to about 0.5° C./second, about 0.5° C./second to about 1.0° C./second, about 0.5° C./second to about 0.9° C./second, about 0.5° C./second to about 0.8° C./second, about 0.5° C./second to about 0.7° C./second, about 0.5° C./second to about 0.6° C./second, about 0.6° C./second to about 1.0° C./second, about 0.6° C./second to about 0.9° C./second, about 0.6° C./second to about 0.8° C./second, about 0.6° C./second to about 0.7° C./second, about 0.7° C./second to about 1.0° C./second, about 0.7° C./second to about 0.9° C./second, about 0.7° C./second to about 0.8° C./second, about 0.8° C./second to about 1.0° C./second, about 0.8° C./second to about 0.9° C./second, or about 0.9° C./second to about 1.0° C./second). In some embodiments, denaturing includes temperature cycling. In some embodiments, denaturing includes alternating between denaturing conditions (e.g., a denaturing temperature) and non-denaturing conditions (e.g., annealing temperature).

It is appreciated that, notwithstanding any particular function in an embodiment, the hairpin sequence can be any sequence configuration, so long as a hairpin is formed. Thus, in some instances, it could be, for example, a degenerate sequence, a random sequence, or otherwise (comprising any sequence of polynucleotides).

In some embodiments, the hairpin sequence further includes a sequence that is capable of binding to a capture domain of a capture probe. For example, releasing the hairpin sequence from the capture binding domain can require that the hairpin sequence is cleaved, where the portion of the hairpin sequence that is left following cleavage includes a sequence that is capable of binding to a capture domain of a capture probe. In some embodiments, all or a portion of the hairpin sequence is substantially complementary to a capture domain of a capture probe. In some embodiments, the sequence that is substantially complementary to a capture domain of a capture probe is located on the free 5′ or free 3′ end following cleavage of the hairpin sequence. In some embodiments, the cleavage of the hairpin results in a single stranded sequence that is capable of binding to a capture domain of a capture probe on a spatial array. While the release of a hairpin sequence may enable hybridization to a capture domain of a capture probe, it is contemplated that release of the hairpin would not significantly affect the capture of the target analyte by an analyte-binding moiety or a probe oligonucleotide (e.g., a second probe oligonucleotide).

(B) Capture Binding Domains Having a Plurality of Caged Nucleotides

In some instances, the one or more blocking methods disclosed herein include a plurality of caged nucleotides. In some embodiments, provided herein are methods where a capture binding domain includes a plurality of caged nucleotides. The caged nucleotides prevent the capture binding domain from interacting with the capture domain of the capture probe. The caged nucleotides include caged moieties that block Watson-Crick hydrogen bonding, thereby preventing interaction until activation, for example, through photolysis of the caged moiety that releases the caged moiety and restores the caged nucleotides ability to engage in Watson-Crick base pairing with a complement nucleotide.

FIGS. 7A-7B are demonstrative of blocking a capture binding domain with caged nucleotides. As exemplified in FIG. 7A, an analyte-binding moiety 701 includes an oligonucleotide that includes a primer (e.g., a read2) sequence 702, an analyte-binding-moiety barcode 703 and a capture binding domain having a sequence 704 (e.g., an exemplary polyA). Caged nucleotides 705 block the sequence 704, thereby blocking the interaction between the capture binding domain and the capture domain of the capture probe. The same mechanism is depicted in FIG. 7B, except for a probe oligonucleotide 707 (e.g., a second probe oligonucleotide that can be used in a RNA templated ligation as described below) that includes a first sequence 704 (e.g., exemplary poly A) blocked with caged nucleotides 705, thereby blocking the interaction between the capture binding domain and the capture domain of the capture probe.

FIGS. 7C- 7D demonstrate the release of the caged nucleotides 705 via, in this example, the application of light for photolysis and release of the caged nucleotides 705 from the capture binding domain 704, thereby allowing the capture binding domain to hybridize to the capture domain of the capture probe.

In some embodiments, the capture binding domain includes a plurality of caged nucleotides, wherein a caged nucleotide of the plurality of caged nucleotides includes a caged moiety that is capable of preventing interaction between the capture binding domain and the capture domain of the capture probe. Non-limiting examples of caged nucleotides, also known as light-sensitive oligonucleotides, are described in Liu et al., Acc. Chem. Res., 47(1): 45-55 (2014), which is incorporated by reference in its entirety. In some embodiments, the caged nucleotides include a caged moiety selected from the group of 6-nitropiperonyloxymethy (NPOM), 1-(ortho-nitrophenyl)-ethyl (NPE), 2-(ortho-nitrophenyl)propyl (NPP), diethylaminocoumarin (DEACM), and nitrodibenzofuran (NDBF).

In some embodiments, a caged nucleotide includes a non-naturally-occurring nucleotide selected from the group consisting of 6-nitropiperonyloxymethy (NPOM)-caged adenosine, 6-nitropiperonyloxymethy (NPOM)-caged guanosine, 6-nitropiperonyloxymethy (NPOM)-caged uridine, and 6-nitropiperonyloxymethy (NPOM)-caged thymidine. For example, the capture binding domain includes one or more caged nucleotides where the cage nucleotides include one or more 6-nitropiperonyloxymethy (NPOM)-caged guanosine. In another example, the capture binding domain includes one or more caged nucleotides where the cage nucleotides include one or more nitropiperonyloxymethy (NPOM)-caged uridine. In yet another example, the capture binding domain includes one or more caged nucleotides where the caged nucleotide includes one or more 6-nitropiperonyloxymethy (NPOM)-caged thymidine.

In some embodiments, the capture binding domain includes a combination of at least two or more of any of the caged nucleotides described herein. For example, the capture binding domain can include one or more 6-nitropiperonyloxymethy (NPOM)-caged guanosine and one or more nitropiperonyloxymethy (NPOM)-caged uridine. It is appreciated that a capture binding domain can include any combination of any of the caged nucleotides described herein.

In some embodiments, the capture binding domain includes one caged nucleotide, two caged nucleotides, three caged nucleotides, four caged nucleotides, five caged nucleotides, six caged nucleotides, seven caged nucleotides, eight caged nucleotides, nine caged nucleotides, or ten or more caged nucleotides.

In some embodiments, the capture binding domain includes a caged nucleotide at the 3′ end. In some embodiments, the capture binding domain includes two caged nucleotides at the 3′ end. In some embodiments, the capture binding domain includes at least three caged nucleotides at the 3′ end.

In some embodiments, the capture binding domain includes a caged nucleotide at the 5′ end. In some embodiments, the capture binding domain includes two caged nucleotides at the 5′ end. In some embodiments, the capture binding domain includes at least three caged nucleotides at the 5′ end.

In some embodiments, the capture binding domain includes a caged nucleotide at every odd position starting at the 3′ end of the capture binding domain. In some embodiments, the capture binding domain includes a caged nucleotide at every odd position starting at the 5′ end of the capture binding domain. In some embodiments, the capture binding domain includes a caged nucleotide at every even position starting at the 3′ end of the capture binding domain. In some embodiments, the capture binding domain includes a caged nucleotide at every even position starting at the 5′ end of the capture binding domain.

In some embodiments, the capture binding domain includes a sequence including at least 10%, at least, 20%, or at least 30% caged nucleotides. In some instances, the percentage of caged nucleotides in the capture binding domain is about 40%, about 50%, about 60%, about 70%, about 80% or higher. In some embodiments, the capture binding domain includes a sequence where every nucleotide is a caged nucleotide. It is understood that the limit of caged nucleotides is based on the sequence of the capture binding domain and on steric limitations of creating caged nucleotides in proximity to one another. Thus, in some instances, particular nucleotides (e.g., guanines) are replaced with caged nucleotides. In some instances, all guanines in a capture binding domain are replaced with caged nucleotides. In some instances, a fraction (e.g., about 10%, about 20%, about 30%, about 40%, about 50%, about 60%, about 70%, about 80%, about 90%, or about 95%) of guanines in a capture binding domain are replaced with caged nucleotides. In some instances, particular nucleotides (e.g., uridines or thymines) are replaced with caged nucleotides. In some instances, all uridines or thymines in a capture binding domain are replaced with caged nucleotides. In some instances, a fraction (e.g., about 10%, about 20%, about 30%, about 40%, about 50%, about 60%, about 70%, about 80%, about 90%, or about 95%) of uridines or thymines in a capture binding domain are replaced with caged nucleotides. Caged nucleotides are disclosed in Govan et al., Nucleic Acids Research (2013) 41; 22, 10518-10528, which is incorporated by reference in its entirety.

In some embodiments, the capture binding domain includes caged nucleotides that are evenly distributed throughout the capture binding domain. For example, a capture binding domain can include a sequence that includes at least 10% caged nucleotides where the caged nucleotides are evenly distributed throughout the capture binding domain. In some embodiments, the capture binding domain includes a sequence that is at least 10% caged nucleotides and where the 10% caged nucleotides are positioned at the 3′ of the capture binding domain. In some embodiments, the capture binding domain includes a sequence that is at least 10% caged nucleotides and where the 10% caged nucleotides are positioned at the 5′ end of the capture binding domain. In some embodiments, the caged nucleotides are included at every third, at every fourth, at every fifth, at every sixth nucleotide, or a combination thereof, of the capture binding domain sequence.

In some embodiments, provided herein are methods for releasing the caged moiety from the caged nucleotide. In some embodiments, releasing the caged moiety from the caged nucleotide includes activating the caged moiety. In some embodiments, releasing the caged moiety from the caged nucleotide restores the caged nucleotides ability to hybridize to a complementary nucleotide through Watson-Crick hydrogen bonding. For example, restoring the caged nucleotides ability to hybridize with a complementary nucleotide enables/restores the capture binding domain’s ability to interact with the capture domain. Upon releasing the caged moiety from the caged nucleotide, the caged nucleotide is no longer “caged” in that the caged moiety is no longer linked (e.g., either covalently or non-covalently) to the caged nucleotide. As used herein, the term “caged nucleotide” can refer to a nucleotide that is linked to a caged moiety or a nucleotide that was linked to a caged moiety but is no longer linked as a result of activation of the caged moiety.

In some embodiments, provided herein are methods for activating the caged moiety thereby releasing the caged moiety from the caged nucleotide. In some embodiments, activating the caged moiety includes photolysis of the caged moiety from the nucleotide. As used herein, “photolysis” can refer to the process of removing or separating a caged moiety from a caged nucleotide using light. In some embodiments, activating (e.g., photolysis) the caged moiety includes exposing the caged moiety to light pulses (e.g., two or more, three or more, four or more, or five or more pulses of light) that in total are sufficient to release the caged moiety from the caged nucleotide. In some embodiments, activating the caged moiety includes exposing the caged moiety to a light pulse (e.g., a single light pulse) that is sufficient to release the caged moiety from the caged nucleotide. In some embodiments, activating the caged moiety includes exposing the caged moiety to a plurality of pulses (e.g., one, or two or more pulses of light) where the light is at a wavelength of about less than about 360 nm. In some embodiments, the source of the light that is at a wavelength of about less than 360 nm is a UV light. The UV light can originate from a fluorescence microscope, a UV laser or a UV flashlamp, or any source of UV light known in the art.

In some embodiments, once the caged moiety is released from the capture binding domain, the oligonucleotide, probe oligonucleotide, or ligation product that includes the capture binding domain, is able to hybridize to the capture domain of the capture probe. Finally, to identify the location of the analyte or determine the interaction between two or more analyte-binding moieties, all or part of the sequence of the oligonucleotide, probe oligonucleotide, or ligation product, or a complement thereof, can be determined.

(C) Methods of Enhancing Detection of Protein Analytes (I) Analyte-binding Moieties

In some embodiments, provided herein are methods for enhancing the specificity of binding of an antigen-binding moiety to a target analyte in a biological sample where the method includes blocking the antigen-binding moiety from prematurely hybridizing with a capture probe affixed to a substrate. The antigen-binding moiety includes an oligonucleotide having at least an analyte-binding-moiety barcode and a capture binding domain where the capture binding domain can hybridize to a capture domain of a capture probe. In the method provided herein, the capture binding domain of the oligonucleotide is blocked and prevented from hybridizing to the capture domain of the capture probe on the substrate. Releasing the block from the capture binding domain enables the capture binding domain to specifically bind the capture domain of the capture probe on the substrate at the appropriate time in the workflow.

FIG. 1 exemplifies an analyte binding moiety. Briefly, an analyte binding moiety 102 including an analyte binding domain 104 which can bind to a target analyte 106. The analyte binding moiety further includes an oligonucleotide 108 with comprises an analyte binding moiety barcode and a capture binding domain which can hybridize to a capture domain of a capture probe.

FIG. 2 is a schematic diagram depicting an exemplary interaction between a feature-immobilized capture probe 224 and an analyte capture agent 226. The feature-immobilized capture probe 224 can include a spatial barcode 208 as well as one or more functional sequences 206 and 210, as described elsewhere herein. The capture probe can also include a capture domain 212 that is capable of binding to an analyte capture agent 226. The analyte capture agent 226 can include a functional sequence 218, capture agent barcode domain 216, and an analyte capture sequence 214 that is capable of binding to the capture domain 212 of the capture probe 224. The analyte capture agent can also include a linker 220 that allows the capture agent barcode domain 216 to couple to the analyte binding moiety 222.

FIGS. 3A, 3B, and 3C are schematics illustrating how streptavidin cell tags can be utilized in an array-based system to produce a spatially-barcoded cell or cellular contents. For example, as shown in FIG. 3A, peptide-bound major histocompatibility complex (MHC) can be individually associated with biotin (β2m) and bound to a streptavidin moiety such that the streptavidin moiety comprises multiple pMHC moieties. Each of these moieties can bind to a TCR such that the streptavidin binds to a target T-cell via multiple MCH/TCR binding interactions. Multiple interactions synergize and can substantially improve binding affinity. Such improved affinity can improve labelling of T-cells and also reduce the likelihood that labels will dissociate from T-cell surfaces. As shown in FIG. 3B, a capture agent barcode domain 301 can be modified with streptavidin 302 and contacted with multiple molecules of biotinylated MHC 303 such that the biotinylated MHC 303 molecules are coupled with the streptavidin conjugated capture agent barcode domain 301. The result is a barcoded MHC multimer complex 305. As shown in FIG. 3B, the capture agent barcode domain sequence 301 can identify the MHC as its associated label and also includes optional functional sequences such as sequences for hybridization with other oligonucleotides. As shown in FIG. 3C, one example oligonucleotide is capture probe 306 that comprises a complementary sequence (e.g., rGrGrG corresponding to C C C), a barcode sequence and other functional sequences, such as, for example, a UMI, an adapter sequence (e.g., comprising a sequencing primer sequence (e.g., R1 or a partial R1 (“pR1”), R2), a flow cell attachment sequence (e.g., P5 or P7 or partial sequences thereof)), etc. In some cases, capture probe 306 may at first be associated with a feature (e.g., a gel bead) and released from the feature. In other embodiments, capture probe 306 can hybridize with a capture agent barcode domain 301 of the MHC-oligonucleotide complex 305. The hybridized oligonucleotides (Spacer C C C and Spacer rGrGrG) can then be extended in primer extension reactions such that constructs comprising sequences that correspond to each of the two spatial barcode sequences (the spatial barcode associated with the capture probe, and the barcode associated with the MHC-oligonucleotide complex) are generated. In some cases, one or both of these corresponding sequences may be a complement of the original sequence in capture probe 306 or capture agent barcode domain 301. In other embodiments, the capture probe and the capture agent barcode domain are ligated together. The resulting constructs can be optionally further processed (e.g., to add any additional sequences and/or for clean-up) and subjected to sequencing. As described elsewhere herein, a sequence derived from the capture probe 306 spatial barcode sequence may be used to identify a feature and the sequence derived from spatial barcode sequence on the capture agent barcode domain 301 may be used to identify the particular peptide MHC complex 304 bound on the surface of the cell (e.g., when using MHC-peptide libraries for screening immune cells or immune cell populations).

In one feature of the disclosure, the method for enhancing the specificity of binding of an analyte-binding moiety to a target analyte includes (a) contacting the biological sample with a substrate, wherein the substrate includes a plurality of capture probes affixed to the substrate, wherein the capture probe includes a spatial barcode and the capture domain; (b) binding an analyte-binding moiety to a target analyte in the biological sample, wherein the analyte-binding moiety is associated with an oligonucleotide including a capture binding domain that hybridizes to a capture domain of a capture probe, wherein the capture binding domain is blocked and prevented from hybridizing to the capture domain of the capture probe affixed to the substrate; (c) releasing the block from the capture binding domain, thereby allowing the capture binding domain to bind to the capture domain of the capture probe on the substrate, thereby enhancing the specificity of binding of an analyte-binding moiety to a target analyte in a biological sample.

In another feature of the disclosure, the method for enhancing the specificity of binding of an analyte-binding moiety to a target analyte includes (a) contacting the biological sample with a substrate, wherein the substrate includes a plurality of capture probes affixed to the substrate, wherein the capture probe includes a spatial barcode and the capture domain; (b) binding an analyte-binding moiety to a target analyte in the biological sample, wherein the analyte-binding moiety is associated with an oligonucleotide including a capture binding domain that hybridizes to a capture domain of a capture probe, wherein the capture binding domain includes caged nucleotides, where a caged nucleotide of the plurality of caged nucleotides includes a caged moiety that blocks the capture binding domain from hybridizing to the capture domain of the capture probe affixed to the substrate; (c) releasing (e.g., through activation or photolysis) the caged moiety from the caged nucleotides of the plurality of caged nucleotides from the capture binding domain, and allowing the capture binding domain to bind to the capture domain of the capture probe on the substrate, thereby enhancing the specificity of binding of an analyte-binding moiety to a target analyte in a biological sample.

In another feature of the disclosure, the method for enhancing the specificity of binding of an analyte-binding moiety to a target analyte includes (a) contacting the biological sample with a substrate, wherein the substrate includes a plurality of capture probes affixed to the substrate, wherein the capture probes include a spatial barcode and the capture domain; (b) binding an analyte-binding moiety to a target analyte in the biological sample, wherein the analyte-binding moiety is associated with an oligonucleotide including a capture binding domain that hybridizes to a capture domain of a capture probe, wherein the capture binding domain includes a first sequence that hybridizes to the capture domain of the capture probe and a second sequence (e.g., a blocking probe) that hybridizes to the first sequence, wherein the second sequence (e.g., the blocking probe) blocks the first sequence from hybridizing to the capture domain of the capture probe affixed to the substrate; (c) releasing the second sequence (e.g., the blocking probe) from the first sequence of the capture binding domain and allowing the first sequence of the capture binding domain to bind to the capture domain of the capture probe on the substrate, thereby enhancing the specificity of binding of an analyte-binding moiety to a target analyte in a biological sample.

Also provided herein are methods for determining (i) all or a part of the sequence of the oligonucleotide specifically bound to the capture domain, or a complement thereof, and (ii) all or a part of the sequence of the spatial barcode, or a complement thereof, and using the determined sequence of (i) and (ii) to identify a location of the target analyte in the biological sample.

In some embodiments, the oligonucleotide associated with the analyte-binding moiety includes a capture binding domain that hybridizes to a capture domain of a capture probe, where the capture binding domain includes a plurality of caged nucleotides, and where a caged nucleotide (e.g., any of the exemplary caged nucleotides described herein) of the plurality of caged nucleotides prevents hybridization between the capture binding domain and the capture domain of the capture probe.

In some embodiments, the oligonucleotide associated with the analyte-binding moiety includes an analyte-binding-moiety barcode (e.g., any of the exemplary an analyte-binding-moiety barcodes described herein). For example, the oligonucleotide associated with the analyte-binding moiety includes an analyte-binding-moiety barcode that identifies the particular analyte-binding moiety. In some embodiments where the method includes two or more analyte binding moieties that each bind to a different analyte, each analyte binding moiety is associated with an oligonucleotide that includes an analyte-binding-moiety barcode that enables differentiation between the two analyte-binding moieties. In some embodiments, the method identifying the location of the analyte in the biological sample includes determining all or part of the sequence of the oligonucleotide that is bound to the capture domain of the capture probe and includes an analyte-binding-moiety barcode.

In some embodiments, the oligonucleotide is a sequence of about 10 nucleotides to about 100 nucleotides (e.g., a sequence of about 10 nucleotides to about 90 nucleotides, about 10 nucleotides to about 80 nucleotides, about 10 nucleotides to about 70 nucleotides, about 10 nucleotides to about 60 nucleotides, about 10 nucleotides to about 50 nucleotides, about 10 nucleotides to about 40 nucleotides, about 10 nucleotides to about 30 nucleotides, about 10 nucleotides to about 20 nucleotides, about 20 nucleotides to about 90 nucleotides, about 20 nucleotides to about 80 nucleotides, about 20 nucleotides to about 70 nucleotides, about 20 nucleotides to about 60 nucleotides, about 20 nucleotides to about 50 nucleotides, about 20 nucleotides to about 40 nucleotides, about 20 nucleotides to about 30 nucleotides, about 30 nucleotides to about 90 nucleotides, about 30 nucleotides to about 80 nucleotides, about 30 nucleotides to about 70 nucleotides, about 30 nucleotides to about 60 nucleotides, about 30 nucleotides to about 50 nucleotides, about 30 nucleotides to about 40 nucleotides, about 40 nucleotides to about 90 nucleotides, about 40 nucleotides to about 80 nucleotides, about 40 nucleotides to about 70 nucleotides, about 40 nucleotides to about 60 nucleotides, about 40 nucleotides to about 50 nucleotides, about 50 nucleotides to about 90 nucleotides, about 50 nucleotides to about 80 nucleotides, about 50 nucleotides to about 70 nucleotides, about 50 nucleotides to about 60 nucleotides, about 60 nucleotides to about 90 nucleotides, about 60 nucleotides to about 80 nucleotides, about 60 nucleotides to about 70 nucleotides, about 70 nucleotides to about 90 nucleotides, about 70 nucleotides to about 80 nucleotides, or about 80 nucleotides to about 90 nucleotides).

In some embodiments, the analyte-binding moiety is a protein. In some embodiments, the protein is an antibody. For example, the antibody can be a monoclonal antibody, recombinant antibody, synthetic antibody, a single domain antibody, a single-chain variable fragment (scFv), and or an antigen-binding fragment (Fab).

In some embodiments, the analyte-binding moiety is associated with the oligonucleotide via a linker. In some embodiments, the linker is a cleavable linker. For example, the cleavable linker can be a photocleavable linker, UV-cleavable linker, or an enzyme-cleavable linker. In some embodiments, the cleavable linker is an enzyme cleavable linker. In some embodiments, the method also includes releasing the oligonucleotide from the analyte-binding moiety where the analyte-binding moiety is exposed to an enzyme that cleaves that cleavable linker thereby releasing the oligonucleotide. In some embodiments, releasing the oligonucleotide occurs prior to, contemporaneously with, or after releasing the block on the capture binding domain. In some embodiments, the oligonucleotide when associated with an analyte-binding moiety includes a free 3′ end. In some embodiments, the oligonucleotide when associated with an analyte-binding moiety includes a free 5′ end. In some embodiments, the analyte-binding moiety is as described herein.

In some embodiments, the analyte-binding moiety is a non-protein. For example, the analyte-binding moiety is an aptamer. In some embodiments, the analyte-binding moiety is a DNA aptamer. In some embodiments, the DNA aptamer is a single-stranded DNA molecule. In some embodiments, the analyte-binding moiety is an RNA aptamer. In some embodiments, the aptamer is synthetic. In some embodiments, the RNA aptamer is a single-stranded RNA molecule. In some embodiments, the aptamer binds to a specific target, including but not limited to, proteins, peptides, carbohydrates, small molecules, and toxins. In some embodiments, an aptamer as disclosed herein binds into its target specifically by folding into tertiary structures.

(Ii) Proximity Ligation Including a Blocking Domain

In some embodiments, provided herein are methods for enhancing the specificity of binding of a first analyte-binding moiety to a first target analyte and a second analyte binding moiety to a second target analyte in a biological sample where the method includes binding a first analyte binding moiety including a first oligonucleotide to a first target analyte and a second analyte binding moiety including a second oligonucleotide to a second target analyte and generating a ligation product based on proximity ligation of the first oligonucleotide to the second nucleotide. One of the oligonucleotides that is included in the ligation product includes a capture binding domain that is blocked and prevented from hybridizing to a capture domain of a capture probe. Releasing the block from the capture binding enables the ligation product to specifically bind via the capture binding domain to the capture domain of the capture probe on the substrate. In some embodiments, the ligation product generated from the ligation of (i) a first oligonucleotide bound to a first analyte-binding moiety to (ii) a second oligonucleotide bound to a second analyte-binding moiety based on the proximity of the first and second analyte-binding moieties is used to determine a location of least one analyte in a biological sample. In some embodiments, because the capture binding domain is prevented from hybridizing with the capture domain of the capture probe, inadvertent capturing of the analyte-binding moiety can be minimized, thereby reducing background signal on the substrate.

In another feature of the disclosure, the method for enhancing the specificity of binding of a first analyte-binding moiety to a first target analyte and a second analyte binding moiety to a second target analyte in a biological sample includes: (a) contacting the biological sample with a substrate, wherein the substrate includes a plurality of capture probes affixed to the substrate, wherein the capture probe includes a spatial barcode and a capture domain; (b) binding a first analyte-binding moiety to a first target analyte in the biological sample, wherein the first analyte-binding moiety is associated with a first oligonucleotide including: (i) a first barcode that identifies the first analyte-binding moiety; and (ii) a first bridge sequence; (c) binding a second analyte-binding moiety to a second target analyte in the biological sample, wherein the second analyte-binding moiety is bound to a second oligonucleotide including:(i) a capture binding domain that binds to a capture domain on a capture probe,(ii) a second barcode that identifies the second analyte-binding moiety; and (iii) a second bridge sequence; wherein the capture binding domain is blocked and prevented from hybridizing to the capture domain of the capture probe affixed to the substrate; (d) contacting the biological sample with a third oligonucleotide; (e) hybridizing the third oligonucleotide to the first bridge sequence of the first oligonucleotide and the second bridge sequence of the second oligonucleotide; (f) ligating the first oligonucleotide and the second oligonucleotide, thereby creating a ligated probe; (g) releasing the block from the capture binding domain, thereby allowing the capture binding domain of the second oligonucleotide to bind to the capture domain of the capture probe on the substrate; thereby enhancing the specificity of binding of a first analyte-binding moiety to a first target analyte and a second analyte binding moiety to a second target analyte in a biological sample.

In another feature of the disclosure, the method for enhancing the specificity of binding of a first analyte-binding moiety to a first target analyte and a second analyte binding moiety to a second target analyte in a biological sample includes: (a) contacting the biological sample with a substrate, wherein the substrate includes a plurality of capture probes affixed to the substrate, wherein the capture probe includes a spatial barcode and a capture domain; (b) binding a first analyte-binding moiety to a first target analyte in the biological sample, wherein the first analyte-binding moiety is associated with a first oligonucleotide including: (i) a first barcode that identifies the first analyte-binding moiety; and (ii) a first bridge sequence; (c) binding a second analyte-binding moiety to a second target analyte in the biological sample, wherein the second analyte-binding moiety is bound to a second oligonucleotide including:(i) a capture binding domain that binds to a capture domain on a capture probe, (ii) a second barcode that identifies the second analyte-binding moiety; and (iii) a second bridge sequence; wherein the capture binding domain includes a plurality of caged nucleotides, where a caged nucleotide of the plurality of caged nucleotides includes a caged moiety that blocks hybridization between the capture binding domain and the capture domain of the capture probe affixed to the substrate; (d) contacting the biological sample with a third oligonucleotide; (e) hybridizing the third oligonucleotide to the first bridge sequence of the first oligonucleotide and the second bridge sequence of the second oligonucleotide; (f) ligating the first oligonucleotide and the second oligonucleotide, thereby creating a ligated probe; (g) releasing through activation using photolysis the caged moiety from the caged nucleotides from the capture binding domain, thereby allowing the capture binding domain of the second oligonucleotide to bind to the capture domain of the capture probe on the substrate; thereby enhancing the specificity of binding of a first analyte-binding moiety to a first target analyte and a second analyte binding moiety to a second target analyte in a biological sample.

In another feature of the disclosure, the method for enhancing the specificity of binding of a first analyte-binding moiety to a first target analyte and a second analyte binding moiety to a second target analyte in a biological sample includes: (a) contacting the biological sample with a substrate, wherein the substrate includes a plurality of capture probes affixed to the substrate, wherein the capture probe includes a spatial barcode and a capture domain; (b) binding a first analyte-binding moiety to a first target analyte in the biological sample, wherein the first analyte-binding moiety is associated with a first oligonucleotide including: (i) a first barcode that identifies the first analyte-binding moiety; and (ii) a first bridge sequence; (c) binding a second analyte-binding moiety to a second target analyte in the biological sample, wherein the second analyte-binding moiety is bound to a second oligonucleotide including: (i) a capture binding domain that binds to a capture domain on a capture probe, (ii) a second barcode that identifies second analyte-binding moiety; and (iii) a second bridge sequence; wherein the capture binding domain includes a first sequence that hybridizes to the capture domain of the capture probe and a second sequence (e.g., a blocking probe) that hybridizes to the first sequence, wherein the se blocking probe blocks the first sequence from hybridizing to the capture domain of the capture probe affixed to the substrate; (d) contacting the biological sample with a third oligonucleotide; (e) hybridizing the third oligonucleotide to the first bridge sequence of the first oligonucleotide and the second bridge sequence of the second oligonucleotide; (f) ligating the first oligonucleotide and the second oligonucleotide, thereby creating a ligated probe; (g) releasing the blocking probe of the capture binding domain from the first sequence of the capture binding domain and allowing the capture binding domain of the second oligonucleotide to bind to the capture domain of the capture probe on the substrate; thereby enhancing the specificity of binding of a first analyte-binding moiety to a first target analyte and a second analyte binding moiety to a second target analyte in a biological sample.

Also provided herein are methods for determining (i) all or a part of the sequence of the ligation product specifically bound to the capture domain, or a complement thereof, and (ii) all or a part of the sequence of the spatial barcode, or a complement thereof, and using the determined sequence of (i) and (ii) to identify a location of the first target analyte and/or second target analyte in the biological sample.

(Iii) Components of Proximity Ligation (A) First Analyte-binding Domain and Second Analyte-binding Domain

In some embodiments, the first analyte-binding moiety is a first protein. In some embodiments, the first analyte-binding moiety is a first antibody. For example, the first antibody can include, without limitation, a monoclonal antibody, recombinant antibody, synthetic antibody, a single domain antibody, a single-chain variable fragment (scFv), and or an antigen-binding fragment (Fab). In some embodiments, the first analyte-binding moiety binds to a cell surface analyte (e.g., any of the exemplary cell surface analytes described herein). In some embodiments, binding of the analyte is performed metabolically. In some embodiments, binding of the analyte is performed enzymatically. In some embodiments, a first analyte-binding moiety is an analyte capture agent (e.g., any of the exemplary analyte capture agents described herein).

In some embodiments, the first analyte-binding moiety is a non-protein. For example, the first analyte-binding moiety is an aptamer. In some embodiments, the first analyte-binding moiety is a DNA aptamer. In some embodiments, the DNA aptamer is a single-stranded DNA molecule. In some embodiments, the first analyte-binding moiety is an RNA aptamer. In some embodiments, the aptamer is synthetic. In some embodiments, the RNA aptamer is a single-stranded RNA molecule. In some embodiments, the aptamer binds to a specific target, including but not limited to, proteins, peptides, carbohydrates, small molecules, and toxins. In some embodiments, an aptamer as disclosed herein binds into its target specifically by folding into tertiary structures.

In some embodiments, the first analyte-binding moiety binds to an analyte present inside a cell (e.g., any of the exemplary analytes present inside a cell). For example, analytes can be derived from cytosol, from cell nuclei, from mitochondria, from microsomes, and more generally, from any other compartment, organelle, or portion of a cell. In some embodiments, the first analyte-binding moiety binds to an analyte present on the surface of or outside a cell. Analytes present on the surface of or outside a cell can include without limitation, a receptor, an antigen, a surface protein, a transmembrane protein, a cluster of differentiation protein, a protein channel, a protein pump, a carrier protein, a phospholipid, a glycoprotein, a glycolipid, a cell-cell interaction protein complex, an antigen-presenting complex, a major histocompatibility complex, an engineered T-cell receptor, a T-cell receptor, a B-cell receptor, a chimeric antigen receptor, an extracellular matrix protein, a posttranslational modification (e.g., phosphorylation, glycosylation, ubiquitination, nitrosylation, methylation, acetylation or lipidation) state of a cell surface protein, a gap junction, and an adherens junction. In some embodiments, the first analyte-binding moiety is capable of binding to cell surface analytes that are post-translationally modified.

In some embodiments, the first analyte-binding moiety binds cell surface analytes that are post-translationally modified. In such embodiments, the first analyte-binding moiety can be specific for one or more cell surface analytes based on a given state of post-translational modification (e.g., phosphorylation, glycosylation, ubiquitination, nitrosylation, methylation, acetylation or lipidation), such that a cell surface analyte profile can include post-translational modification information of one or more analytes.

In some embodiments, the second analyte-binding moiety is a second protein. In some embodiments, the second analyte-binding moiety is a second antibody. For example, the second antibody can include, without limitation, a monoclonal antibody, recombinant antibody, synthetic antibody, a single domain antibody, a single-chain variable fragment (scFv), and or an antigen-binding fragment (Fab). In some embodiments, the second analyte-binding moiety binds to a cell surface analyte (e.g., any of the exemplary cell surface analytes described herein). In some embodiments, the second analyte-binding moiety binds to an analyte present inside a cell (e.g., any of the exemplary analytes present inside a cell). In some embodiments, the second analyte-binding moiety binds a non-protein. For example, the second analyte-binding moiety can bind to a nucleic acid. In some embodiments, the second analyte-binding moiety binds cell surface analytes that are post-translationally modified. In such embodiments, the second analyte-binding moiety can be specific for one or more cell surface analytes based on a given state of post-translational modification (e.g., phosphorylation, glycosylation, ubiquitination, nitrosylation, methylation, acetylation or lipidation), such that a cell surface analyte profile can include post-translational modification information of one or more analytes. In some embodiments, a second analyte-binding moiety is an analyte capture agent (e.g., any of the exemplary analyte capture agents described herein).

In some embodiments, the second analyte-binding moiety is a non-protein. For example, the second analyte-binding moiety is an aptamer. In some embodiments, the second analyte-binding moiety is a DNA aptamer. In some embodiments, the DNA aptamer is a single-stranded DNA molecule. In some embodiments, the second analyte-binding moiety is an RNA aptamer. In some embodiments, the aptamer is synthetic. In some embodiments, the RNA aptamer is a single-stranded RNA molecule. In some embodiments, the aptamer binds to a specific target, including but not limited to, proteins, peptides, carbohydrates, small molecules, and toxins. In some embodiments, an aptamer as disclosed herein binds into its target specifically by folding into tertiary structures.

In some embodiments, the second analyte-binding moiety binds to an analyte present inside a cell (e.g., any of the exemplary analytes present inside a cell). For example, analytes can be derived from cytosol, from cell nuclei, from mitochondria, from microsomes, and more generally, from any other compartment, organelle, or portion of a cell. In some embodiments, the second analyte-binding moiety binds to an analyte present on the surface of or outside a cell. Analytes present on the surface of or outside a cell can include without limitation, a receptor, an antigen, a surface protein, a transmembrane protein, a cluster of differentiation protein, a protein channel, a protein pump, a carrier protein, a phospholipid, a glycoprotein, a glycolipid, a cell-cell interaction protein complex, an antigen-presenting complex, a major histocompatibility complex, an engineered T-cell receptor, a T-cell receptor, a B-cell receptor, a chimeric antigen receptor, an extracellular matrix protein, a posttranslational modification (e.g., phosphorylation, glycosylation, ubiquitination, nitrosylation, methylation, acetylation or lipidation) state of a cell surface protein, a gap junction, and an adherens junction. In some embodiments, the second analyte-binding moiety is capable of binding to cell surface analytes that are post-translationally modified.

In some embodiments, the first analyte-binding moiety and the second analyte-binding moiety each bind to the same analyte. For example, the first analyte-binding moiety binds to a first polypeptide and the second analyte-binding moiety also binds to the first polypeptide. In some embodiments, the first analyte-binding moiety and/or second analyte-binding moiety each bind to a different analyte. For example, in some embodiments, the first analyte-binding moiety binds to a first polypeptide and the second analyte-binding moiety binds to a second polypeptide. In some embodiments, the first analyte-binding moiety binds to a first polypeptide and the second analyte-binding moiety binds to a post-translational modification of the first polypeptide. In some embodiments, the first analyte-binding moiety binds to a first polypeptide and the second analyte-binding moiety binds to a post-translational modification of a second polypeptide. In some embodiments, the first analyte-binding moiety binds to a first post-translational modification and the second analyte-binding moiety binds to a second polypeptide. In some embodiments, the first analyte-binding moiety binds to a first post-translational modification and the second analyte-binding moiety binds to a second post-translational modification of a second polypeptide.

In some embodiments, the first and/or second analyte includes, but is not limited to, lipids, carbohydrates, peptides, proteins, glycoproteins (N-linked or O-linked), lipoproteins, phosphoproteins, specific phosphorylated or acetylated variants of proteins, amidation variants of proteins, hydroxylation variants of proteins, methylation variants of proteins, ubiquitylation variants of proteins, sulfation variants of proteins, viral coat proteins, extracellular and intracellular proteins, antibodies, and antigen binding fragments.

In some embodiments, the first analyte and the second analyte interact with each other in the biological sample. In some embodiments, the first analyte and the second analyte interact with each other via a protein-protein interaction. In a non-limiting example, the first analyte can be a receptor polypeptide and a second analyte can be a ligand polypeptide.

(B) First Oligonucleotide

In some embodiments, a first oligonucleotide is bound (e.g., conjugated or otherwise attached) to a first analyte-binding moiety. For example, the first oligonucleotide can be covalently linked to the first analyte-binding moiety. In some embodiments, the first oligonucleotide is bound to the first analyte-binding moiety via its 5′ end. In some embodiments, the first oligonucleotide includes a free 3′ end. In some embodiments the first oligonucleotide is bound to the first analyte-binding moiety via its 3′ end. In some embodiments, the first oligonucleotide includes a phosphate at its free 5′ end.

In some embodiments, the oligonucleotide is bound to the first analyte-binding moiety via a first linker. For example, the first oligonucleotide is bound to the first analyte-binding moiety via a first linker. In some embodiments, the first linker is a first cleavable linker (e.g., any of the exemplary cleavable linkers described herein). In some embodiment, the first linker is a linker with photo-sensitive chemical bonds (e.g., photo-cleavable linkers). In some embodiments, the first linker is a cleavable linker that can undergo induced dissociation.

In some embodiments, a first oligonucleotide is bound to a first analyte-binding domain via a 5′ end.

In some embodiments, a first oligonucleotide includes from 5′ to 3′: a functional sequence (e.g., any of the exemplary functional sequences described herein), a first barcode (e.g., a first analyte-binding-moiety barcode), and a first bridge sequence. In some embodiments, the functional sequence of the first oligonucleotide includes a primer sequence. The primer sequence can be used to bind a primer that can be used to amplify (i) at least part of the first oligonucleotide and (ii) at least part of any additional oligonucleotides (e.g., the first oligonucleotide) ligated to the first oligonucleotide. In some embodiments, a first oligonucleotide includes from 5′ to 3′: a functional sequence (e.g., any of the exemplary functional sequences described herein) and a first bridge sequence. In some embodiments, a first oligonucleotide includes from 5′ to 3′: a first barcode sequence (e.g., a first analyte-binding-moiety barcode), and a first bridge sequence. In some embodiments, a first oligonucleotide includes from 5′ to 3′: a functional sequence (e.g., any of the exemplary functional sequences described herein) and a first barcode sequence (e.g., a first analyte-binding-moiety barcode). In some embodiments, a first oligonucleotide includes from 5′ to 3′: a functional sequence (e.g., any of the exemplary functional sequences described herein), a first barcode sequence (e.g., a first analyte-binding-moiety barcode), and a first bridge sequence. In some embodiments, a second oligonucleotide includes from 5′ to 3′: a second barcode (e.g., a second analyte-binding-moiety barcode) and a functional sequence (e.g., any of the exemplary functional sequences described herein). In some embodiments, a second oligonucleotide includes from 5′ to 3′: a second barcode (e.g., a second analyte-binding-moiety barcode) and a capture binding domain. In some embodiments, a second oligonucleotide includes from 5′ to 3′: a second barcode (e.g., a second analyte-binding-moiety barcode) and a capture binding domain, where the capture binding domain includes a first sequence that binds to a capture domain of a capture probe and a second sequence (e.g., a blocking probe) that hybridizes to the first sequence to block the first sequence from hybridizing to the capture domain of the capture probe.

In some embodiments, a first oligonucleotide is bound to first analyte-binding domain via a 3′ end. In some embodiments, a first oligonucleotide includes from 3′ to 5′: a capture binding domain sequence, a first barcode (e.g., a first analyte-binding-moiety barcode), and a first bridge sequence. In some embodiments, a first oligonucleotide includes from 3′ to 5′: a capture binding domain sequence and a first bridge sequence. In some embodiments, a first oligonucleotide includes from 3′ to 5′: a first barcode sequence (e.g., a first analyte-binding-moiety barcode) and a first bridge sequence. In some embodiments, a first oligonucleotide includes from 3′ to 5′: a capture binding domain sequence and a first barcode sequence (e.g., a first analyte-binding-moiety barcode). In some embodiments of any of the first oligonucleotides that include a capture binding domain, the capture binding domain includes a first sequence that binds to a capture domain of a capture probe and a second sequence (e.g., a blocking probe) that hybridizes to the first sequence to block the first sequence from hybridizing to the capture domain of the capture probe.

In some embodiments, a first barcode (e.g., a first analyte-binding-moiety barcode) is used to identify the first analyte-binding moiety to which it is bound. The first barcode can be any of the exemplary barcodes described herein. In some embodiments, the first oligonucleotide is a first capture agent barcode domain. In some embodiments, the first barcode sequence includes a first analyte-binding-moiety barcode (e.g., any of the exemplary analyte-binding-moiety barcodes described herein). In some embodiments, the first barcode sequence includes a first analyte-binding-moiety barcode and a capture binding domain (e.g., a capture binding domain that includes a first sequence that binds to a capture domain of a capture probe and a s blocking probe that hybridizes to the first sequence to block the first sequence from hybridizing to the capture domain of the capture probe).

(C) Second Oligonucleotide

In some embodiments, a second oligonucleotide is bound (e.g., conjugated or otherwise attached) to a second analyte-binding moiety. For example, the second oligonucleotide can be covalently linked to the second analyte-binding moiety. In some embodiments, the second oligonucleotide is bound to the second analyte-binding moiety via its 5′ end. In some embodiments, the second oligonucleotide includes a free 3′ end. In some embodiments the second oligonucleotide is bound to the second analyte-binding moiety via its 3′ end. In some embodiments, the second oligonucleotide includes a free 5′ end.

In some embodiments, the oligonucleotide is bound to the second analyte-binding moiety via a second linker (e.g., any of the exemplary linkers described herein). For example, the second oligonucleotide is bound to the second analyte-binding moiety via a second linker. In some embodiments, the second linker is a second cleavable linker (e.g., any of the exemplary cleavable linkers described herein). In some embodiment, the second linker is a linker with photo-sensitive chemical bonds (e.g., photo-cleavable linkers). In some embodiments, the second linker is a cleavable linker that can undergo induced dissociation.

In some embodiments, a second oligonucleotide is bound (e.g., attached via any of the methods described herein) to a second analyte-binding domain via a 3′ end. In some embodiments, a second oligonucleotide includes from 3′ to 5′: a capture binding domain, a second barcode (e.g., a second analyte-binding-moiety barcode), and a second bridge sequence. In some embodiments, a second oligonucleotide includes from 3′ to 5′: a capture binding domain and a second bridge sequence. In some embodiments, a second oligonucleotide includes from 3′ to 5′: a second barcode and a second bridge sequence. In some embodiments of any of the second oligonucleotides bound to a second analyte-binding domain where the second oligonucleotide includes a capture binding domain, the capture binding domain includes a first sequence that binds to a capture domain of a capture probe and a second sequence (e.g, a blocking probe) that hybridizes to the first sequence to block the first sequence from hybridizing to the capture domain of the capture probe. In some embodiments of any of the second oligonucleotides bound to a second analyte-binding domain where the second oligonucleotide includes a capture binding domain, the capture binding domains includes caged nucleotides (e.g., any of the caged nucleotides described herein).

In some embodiments, a second oligonucleotide is bound (e.g., attached via any of the methods described herein) to a second analyte-binding domain via a 5′ end. In some embodiments, a second oligonucleotide includes from 5′ to 3′: a second barcode (e.g., a second analyte-binding-moiety barcode) and a capture binding domain. In some embodiments, a second oligonucleotide includes from 5′ to 3′: a functional sequence and a capture binding domain. In some embodiments, a second oligonucleotide includes from 5′ to 3′: a second barcode (e.g., a second analyte-binding-moiety barcode) and a capture binding domain that includes a plurality of caged nucleotides. In some embodiments, a second oligonucleotide includes from 5′ to 3′: a functional sequence and a capture binding domain that includes a plurality of caged nucleotides. In some embodiments, the functional sequence is a primer sequence. The primer sequence can be used to bind a primer that can be used to amplify (i) at least part of the second oligonucleotide. In some embodiments of any of the second oligonucleotides bound to a second analyte-binding domain where the second oligonucleotide includes a capture binding domain, the capture binding domain includes a first sequence that binds to a capture domain of a capture probe and a second sequence (e.g, a blocking probe) that hybridizes to the first sequence to block the first sequence from hybridizing to the capture domain of the capture probe. In some embodiments of any of the second oligonucleotides bound to a second analyte-binding domain where the second oligonucleotide includes a capture binding domain, the capture binding domains includes caged nucleotides (e.g., any of the caged nucleotides described herein).

In some embodiments, a second barcode (e.g., a second analyte-binding-moiety barcode) is used to identify the second analyte-binding moiety to which it is bound. The second barcode can be any of the exemplary barcodes described herein.

In some embodiments, the second oligonucleotide is a second capture agent barcode domain. In some embodiments, the second capture agent barcode domain includes a second analyte-binding-moiety barcode (e.g., any of the exemplary analyte-binding-moiety barcodes described herein). In some embodiments, the second capture agent barcode domain includes a second analyte-binding-moiety barcode and a second capture binding domain.

In some embodiments, a first oligonucleotide is bound to a first analyte-binding domain via a 5′ end and a second oligonucleotide is bound (e.g., attached via any of the method described herein) to a second analyte-binding domain via a 3′ end. In some embodiments, a first oligonucleotide includes from 5′ to 3′: a functional sequence (e.g., any of the exemplary functional sequences described herein), a first barcode (e.g., a first analyte-binding-moiety barcode), and a first bridge sequence, and a second oligonucleotide includes from 3′ to 5′: a capture binding domain (e.g., including a first sequence that binds to a capture domain of a capture probe and a second sequence (e.g., a blocking probe) that binds to the first sequence, a second barcode (e.g., a second analyte-binding-moiety barcode), and a second bridge sequence. In some embodiments, a first oligonucleotide includes from 5′ to 3′: a functional sequence (e.g., any of the exemplary functional sequences described herein), a first barcode (e.g., a first analyte-binding-moiety barcode), and a first bridge sequence, and a second oligonucleotide includes from 3′ to 5′: a capture binding domain that includes a plurality of caged nucleotides, a second barcode (e.g., a second analyte-binding-moiety barcode), and a second bridge sequence.

In some embodiments of any of the methods of determining a location of at least one analyte in a biological sample, a first oligonucleotide is bound (e.g., attached via any of the methods described herein) to a first analyte-binding domain via a 5′ end and a second oligonucleotide is bound (e.g., attached via any of the method described herein) to a second analyte-binding domain via a 5′ end. In some embodiments, a first oligonucleotide includes from 5′ to 3′: a functional sequence (e.g., any of the exemplary functional sequences described herein) and a first barcode (e.g., a first analyte-binding-moiety barcode), and a second oligonucleotide includes from 5′ to 3′: a second barcode (e.g., a second analyte-binding-moiety barcode) and a capture binding domain. In some embodiments, a first oligonucleotide includes from 5′ to 3′: a functional sequence (e.g., any of the exemplary functional sequences described herein) and a first barcode (e.g., a first analyte-binding-moiety barcode), and a second oligonucleotide includes from 5′ to 3′: a second barcode (e.g., a second analyte-binding-moiety barcode), and a capture binding domain that includes a plurality of caged nucleotides. In some embodiments of any of the second oligonucleotides bound to a second analyte-binding domain where the second oligonucleotide includes a capture binding domain, the capture binding domain includes a first sequence that binds to a capture domain of a capture probe and a second sequence (e.g., a blocking probe) that hybridizes to the first sequence to block the first sequence from hybridizing to the capture domain of the capture probe. In some embodiments of any of the second oligonucleotides bound to a second analyte-binding domain where the second oligonucleotide includes a capture binding domain, the capture binding domain includes caged nucleotides (e.g., any of the caged nucleotides described herein).

(D) Third Oligonucleotide

In some embodiments of any of the methods of determining a location of at least one analyte in a biological sample, a third oligonucleotide is hybridized to the first oligonucleotide and the second oligonucleotide via the first bridge sequence and the second bridge sequence, respectively. In some embodiments, a first portion of the third oligonucleotide is at least partially complementary to the first bridge sequence. In some embodiments, a second portion of the third oligonucleotide is at least partially complementary to the second bridge sequence. In some embodiments, the third oligonucleotide hybridizes to the first bridge sequence prior to hybridizing to the second bridge sequence. In some embodiments, the third oligonucleotide hybridizes to the second bridge sequence prior to hybridizing to the first bridge sequence. In some embodiments, the third oligonucleotide hybridizes simultaneously to the first bridge sequence and the second bridge sequence.

In some embodiments, a third oligonucleotide is hybridized to the first oligonucleotide and the second oligonucleotide via the first barcode and the second barcode, respectively. In some embodiments, a first sequence of the third oligonucleotide is at least partially complementary to the first barcode. In some embodiments, a second sequence of the third oligonucleotide is at least partially complementary to the second barcode. In some embodiments, the third oligonucleotide hybridizes to the first barcode prior to hybridizing to the second barcode. In some embodiments, the third oligonucleotide hybridizes to the second barcode prior to hybridizing to the first barcode. In some embodiments, the third oligonucleotide hybridizes simultaneously to the first barcode and the second barcode. In some embodiments, a third oligonucleotide includes from 5′ to 3′ a second portion that binds to a sequence of the second oligonucleotide (e.g., a second barcode) and a first portion that binds to a sequence of the first oligonucleotide (e.g., a first barcode). In some embodiments, a third oligonucleotide includes from 5′ to 3′ a first sequence that binds to a sequence of the first oligonucleotide (e.g., a first barcode) and a second sequence that binds to a sequence of the second oligonucleotide (e.g., a second barcode).

In some embodiments, the third oligonucleotide includes a sequence of about 10 nucleotides to about 100 nucleotides (e.g., a sequence of about 10 nucleotides to about 90 nucleotides, about 10 nucleotides to about 80 nucleotides, about 10 nucleotides to about 70 nucleotides, about 10 nucleotides to about 60 nucleotides, about 10 nucleotides to about 50 nucleotides, about 10 nucleotides to about 40 nucleotides, about 10 nucleotides to about 30 nucleotides, about 10 nucleotides to about 20 nucleotides, about 20 nucleotides to about 90 nucleotides, about 20 nucleotides to about 80 nucleotides, about 20 nucleotides to about 70 nucleotides, about 20 nucleotides to about 60 nucleotides, about 20 nucleotides to about 50 nucleotides, about 20 nucleotides to about 40 nucleotides, about 20 nucleotides to about 30 nucleotides, about 30 nucleotides to about 90 nucleotides, about 30 nucleotides to about 80 nucleotides, about 30 nucleotides to about 70 nucleotides, about 30 nucleotides to about 60 nucleotides, about 30 nucleotides to about 50 nucleotides, about 30 nucleotides to about 40 nucleotides, about 40 nucleotides to about 90 nucleotides, about 40 nucleotides to about 80 nucleotides, about 40 nucleotides to about 70 nucleotides, about 40 nucleotides to about 60 nucleotides, about 40 nucleotides to about 50 nucleotides, about 50 nucleotides to about 90 nucleotides, about 50 nucleotides to about 80 nucleotides, about 50 nucleotides to about 70 nucleotides, about 50 nucleotides to about 60 nucleotides, about 60 nucleotides to about 90 nucleotides, about 60 nucleotides to about 80 nucleotides, about 60 nucleotides to about 70 nucleotides, about 70 nucleotides to about 90 nucleotides, about 70 nucleotides to about 80 nucleotides, or about 80 nucleotides to about 90 nucleotides).

In some embodiments, the third oligonucleotide hybridized to both the first bridge sequence and second bridge sequence enables ligation of the first oligonucleotide and the second oligonucleotide. In some embodiments, a ligase is used. In some aspects, the ligase includes a DNA ligase. In some aspects, the ligase includes RNA ligase. In some aspects, the ligase includes one or more of a T4 RNA ligase (Rnl2), a SplintR ligase, a single stranded DNA ligase, or a T4 DNA ligase. In some aspects, the ligase is a T4 RNA ligase 2 (Rnl2) ligase.

In some embodiments, the ligation of the first oligonucleotide and the second oligonucleotide includes enzymatic or chemical ligation. In some embodiments, the ligation of the first oligonucleotide and the second oligonucleotide includes enzymatic ligation. In some embodiments, the ligation reaction utilizes ligase (e.g., any of the exemplary ligases described herein). For example, the ligase is a splintR ligase.

In some aspects, the methods include “sticky-end” or “blunt-end” ligations. Additionally, single-stranded ligation can be used to perform proximity ligation on a single-stranded nucleic acid molecule.

In some embodiments, the ligase creates a ligation product (e.g., a full length oligonucleotide) that includes both the first oligonucleotide and the second oligonucleotide. In some embodiments, a third oligonucleotide hybridizes to a first oligonucleotide and a second oligonucleotide when the first analyte-binding domain and the second analyte-binding domain are about 400 nm distance (e.g., about 300 nm, about 200 nm, about 150 nm, about 100 nm, about 50 nm, about 25 nm, about 10 nm, or about 5 nm) from each other.

In some embodiments, when the third oligonucleotide is hybridized to the first bridge sequence and the second bridge sequence, the first oligonucleotide and the second oligonucleotide are directly adjacent in a manner that enables ligation in the presence of a ligase enzyme. In some embodiments, when the first oligonucleotide and the second nucleotide are directly adjacent to each other, a ligation reaction occurs between a free 3′ of the first oligonucleotide and a 5′ of the second oligonucleotide that includes a phosphorylated end. In some embodiments, when the first oligonucleotide and the second oligonucleotide are directly adjacent to each other, a ligation reaction occurs directly between a free 3′ end of the second oligonucleotide and a phosphorylated 5′ end of the first oligonucleotide. In some embodiments, when the third oligonucleotide is hybridized to the first bridge sequence and the second bridge sequence, the first oligonucleotide and the second oligonucleotide are not directly adjacent. For example, when the third oligonucleotide is hybridized to the first bridge sequence and the second bridge sequence, the first oligonucleotide and the second oligonucleotide requires additional nucleotides to hybridize to the third oligonucleotide in between the first and second oligonucleotides in order for a ligation product to be generated. In some embodiments, additional nucleotide sequences can be added between the first oligonucleotide and the second oligonucleotide by DNA polymerases (e.g., any of the exemplary DNA polymerases described herein).

In some embodiments, the third oligonucleotide is designed so that a portion of the nucleic acid sequence of the third oligonucleotide does not hybridize to either the first bridge sequence or the second bridge sequence. This portion of the third oligonucleotide can be located between the first sequence that is complementary to the first bridge sequence and the second sequence that is complementary to the second bridge sequence. In some embodiments, the third oligonucleotide is designed so that a portion of the nucleic acid sequence of the third oligonucleotide does not hybridize to either the first barcode or the second barcode. This portion of the third oligonucleotide can be located between the first sequence that is complementary to the first barcode and the second sequence that is complementary to the second barcode. In some embodiments, the portion of the of the nucleic acid sequence of the third oligonucleotide that does not hybridize to the first or second bridge sequence forms a secondary structure (e.g., stem-loop, hairpins, or other structure typically formed by single stranded DNA).

(E) Bridge Sequence (in the First and Second Oligonucleotides)

In some embodiments, the first oligonucleotide and/or the second oligonucleotides include a bridge sequence. In some embodiments, the first oligonucleotide includes a bridge sequence. In some embodiments, the second oligonucleotide includes a bridge sequence. As used herein, a “bridge sequence” refers to a sequence that is at least partially complementary to a third oligonucleotide and functions to bring the a first sequence and a second sequence in sufficient proximity that the first and second sequences can be ligated together. In some aspects, the bridge sequence of the first oligonucleotide has at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% sequence identity to a sequence that is 100% complimentary to the third oligonucleotide sequence. In some aspects, the bridge sequence of the second oligonucleotide has at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% sequence identity to a sequence that is 100% complimentary to the third oligonucleotide sequence. In some embodiments, a bridge sequence (e.g., a first bridge sequence and/or a second bridge sequence) is about 5 nucleotides to about 50 nucleotides (e.g., about 5 nucleotides to about 45 nucleotides, about 5 nucleotides to about 40 nucleotides, about 5 nucleotides to about 35 nucleotides, about 5 nucleotides to about 30 nucleotides, about 5 nucleotides to about 25 nucleotides, about 5 nucleotides to about 20 nucleotides, about 5 nucleotides to about 15 nucleotides, about 5 nucleotides to about 10 nucleotides, about 10 nucleotides to about 45 nucleotides, about 10 nucleotides to about 40 nucleotides, about 10 nucleotides to about 35 nucleotides, about 10 nucleotides to about 30 nucleotides, about 10 nucleotides to about 25 nucleotides, about 10 nucleotides to about 20 nucleotides, about 10 nucleotides to about 15 nucleotides, about 15 nucleotides to about 45 nucleotides, about 15 nucleotides to about 40 nucleotides, about 15 nucleotides to about 35 nucleotides, about 15 nucleotides to about 30 nucleotides, about 15 nucleotides to about 25 nucleotides, about 15 nucleotides to about 20 nucleotides, about 20 nucleotides to about 45 nucleotides, about 20 nucleotides to about 40 nucleotides, about 20 nucleotides to about 35 nucleotides, about 20 nucleotides to about 30 nucleotides, about 20 nucleotides to about 25 nucleotides, about 25 nucleotides to about 45 nucleotides, about 25 nucleotides to about 40 nucleotides, about 25 nucleotides to about 35 nucleotides, about 25 nucleotides to about 30 nucleotides, about 30 nucleotides to about 45 nucleotides, about 30 nucleotides to about 40 nucleotides, about 30 nucleotides to about 35 nucleotides, about 35 nucleotides to about 45 nucleotides, about 35 nucleotides to about 40 nucleotides, or about 40 nucleotides to about 45 nucleotides). In some embodiments, a bridge sequence is a functional sequence (e.g., any of the exemplary functional sequence described herein).

(D) Methods of Enhancing Detection of Nucleic Acid Analytes (I) Spatially Detecting Nucleic Acid Analytes Using Blocking Probes

Disclosed herein are methods of detecting the location and/or the abundance of a nucleic acid analyte. Also disclosed are methods of blocking capture binding domains on an analyte before it is detected by a capture probe on a substrate. For example, in some instances, the methods include placing a biological sample (i.e., any biological sample disclosed herein) on a substrate that has a plurality of capture probes. In some instances, blocking probes are added to the sample and blocking probes interact with a capture binding domain in a nucleic acid analyte (e.g., a poly(A) sequence of an mRNA), thereby blocking the nucleic acid analytes from binding prematurely or binding to other molecules in an untimely and unwanted manner in a biological sample. Thus, the methods disclosed herein allow for nucleic acid analytes, such as mRNA, to be captured with increased efficiency and/or less background. In some instances, blocking is reversible using methods disclosed herein. In some instances, the blocking probe is released from the analyte prior to contacting the analyte with the capture probes.

(Ii) RNA Templated Ligation Using Blocking Probes

RNA templated ligation (RTL) is a process that includes multiple oligonucleotides (e.g., oligonucleotide probes) that hybridize to adjacent complementary nucleic acid analyte sequences. Upon hybridization, the two oligonucleotides are ligated to one another, creating a ligation product that can be used as a proxy for the target nucleic acid analyte, but only in the event that both oligonucleotides hybridize to their respective complementary sequences. In some instances, at least one of the oligonucleotides includes a capture binding domain and a blocking domain. The blocking domain prevents the premature interaction of the capture binding domain with a capture domain on a capture probe on an array described herein. In some instances, the blocking domain is released prior to contacting the ligation product with the array that includes the capture probes. In some instances, also prior to hybridization of the capture binding domain to the capture domain of a capture probe, an endonuclease digests the blocking domain that is hybridized to the ligation product. This step frees the newly formed ligation product to hybridize to the capture probe once the blocking domain is removed from the capture binding domain. In this way, RTL that includes a blocking domain and/or caged nucleotides provides a method to perform targeted RNA capture with increased efficiency.

Targeted RNA capture allows for the examination of a subset of RNA analytes from the entire transcriptome, in its broadest application. One limitation of targeted RNA capture is the ability to accurately detect genetic variants within a subset of RNA analytes. Current limitations are driven by probe design requirements (e.g., a first probe oligonucleotide and/or a second probe oligonucleotide need to incorporate the genetic variation into the probe sequence in order to detect the variation on the complement strand (e.g., the RNA analyte)). Provided herein are methods for identifying a location of an analyte in a biological sample based on (i) probe oligonucleotides that include a blocking domain and/or caged nucleotides that prevent premature interaction of the capture binding domain with the capture domain of the capture probe, (ii) blocking the interaction of the capture binding domain with a capture probe, and (iii) releasing the blocking domain and/or caged nucleotides thereby allowing the capture binding domain to interact with the capture domain on the capture probe at the appropriate time to enhance the capture of the ligation product by the capture probe and/or decrease background binding. The methods provided herein prevent premature interaction between capture binding domains and capture domain, thereby increasing the resolution of the RNA template ligation technology.

In some embodiments, the subset of analytes includes an individual target RNA. In some instances, the presence of the ligation product that is created as a result of the RTL methods described herein indicates that the individual target RNA is present. In some instances, the absence of the ligation product that is created as a result of the RTL methods described herein indicates that the individual target RNA is present. In some instances, an absence of the ligation product is because one of the oligonucleotide probes did not hybridize to the analyte. In some instances, an absence of the ligation product is because at least one or both of the oligonucleotide probes did not hybridize to the analyte.

In some embodiments, the subset of analytes detected using methods disclosed herein includes two or more (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, or more) targeted RNAs. In some embodiments, the subset of analytes includes one or more mRNAs transcribed by a single gene. In some embodiments, the subset of analytes includes one or more mRNAs transcribed by more than one targeted gene. In some embodiments, the subset of analytes includes one or more mRNA splice variants of one or more targeted genes. In some embodiments, the subset of analytes includes non-polyadenylated RNAs in a biological sample. In some embodiments, the subset of analytes includes detection of mRNAs having one or more single nucleotide polymorphisms (SNPs) in a biological sample.

In some embodiments, the subset of analytes includes mRNAs that mediate expression of a set of genes of interest. For example, in some instances, the subset of analytes detected using the RTL methods disclosed herein include analytes that are translated into transcription factors that control one or more cellular pathways. In some embodiments, the subset of analytes includes mRNAs that share identical or substantially similar sequences, which mRNAs are translated into polypeptides having similar functional groups or protein domains. In some embodiments, the subset of analytes includes mRNAs that do not share identical or substantially similar sequences, which mRNAs are translated into proteins that do not share similar functional groups or protein domains. In some embodiments, the subset of analytes includes mRNAs that are translated into proteins that function in the same or similar biological pathways. In some embodiments, the biological pathways are associated with a pathologic disease. For example, targeted RNA capture can detect genes that are overexpressed or underexpressed in a cancer sample. In some embodiments, the targeted RNA capture can detect diseases, cancers, or monitor the efficacy of disease or cancer treatments or therapies based on gene expression profiles. In some embodiments, targeted RNA capture can be used to identify rare and undiagnosed genetic diseases by gene expression patterns of, for example, rare mRNAs.

In some embodiments, the subset of analytes includes 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, about 55, about 60, about 65, about 70, about 75, about 80, about 85, about 90, about 95, about 100, about 110, about 120, about 130, about 140, about 150, about 160, about 170, about 180, about 190, about 200, about 225, about 250, about 275, about 300, about 325, about 350, about 375, about 400, about 425, about 450, about 475, about 500, about 600, about 700, about 800, about 900, about 1000, or more analytes.

In some embodiments, the subset of analytes detected by targeted RNA capture methods provided herein includes a large proportion of the transcriptome of one or more cells. For example, the subset of analytes detected by targeted RNA capture methods provided herein can include at least about 5%, at least about 10%, at least about 15%, at least about 20%, at least about 25%, at least about 30%, at least about 35%, at least about 40%, at least about 45%, at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 95%, or more of the mRNAs present in the transcriptome of one or more cells.

Methods disclosed herein can be performed on any type of sample. In some embodiments, the sample is a fresh tissue. In some embodiments, the sample is a frozen sample. In some embodiments, the sample was previously frozen. In some embodiments, the sample is a formalin-fixed, paraffin embedded (FFPE) sample. FFPE samples generally are heavily cross-linked and fragmented, and therefore this type of sample allows for limited RNA recovery using conventional detection techniques. In certain embodiments, methods of targeted RNA capture provided herein are less affected by RNA degradation associated with FFPE fixation than other methods (e.g., methods that take advantage of oligo-dT capture and reverse transcription of mRNA). In certain embodiments, methods provided herein enable sensitive measurement of specific genes of interest that otherwise might be missed with a whole transcriptomic approach.

In some embodiments, a biological sample (e.g. tissue section) can be fixed with methanol, stained with hematoxylin and eosin, and imaged. In some embodiments, fixing, staining, and imaging occurs before one or more oligonucleotide probes are hybridized to the sample. Some embodiments of any of the workflows described herein can further include a destaining step (e.g., a hematoxylin and eosin destaining step), after imaging of the sample and prior to permeabilizing the sample. For example, destaining can be performed by performing one or more (e.g., one, two, three, four, or five) washing steps (e.g., one or more (e.g., one, two, three, four, or five) washing steps performed using a buffer including HCl). The images can be used to map spatial gene expression patterns back to the biological sample. A permeabilization enzyme can be used to permeabilize the biological sample directly on the slide.

In another feature of the disclosure, provided herein is a method for enhancing the specificity of binding of an oligonucleotide to a target analyte in a biological sample, where the method includes: (a) contacting the biological sample with a substrate, wherein the substrate includes a plurality of capture probes affixed to the substrate, wherein the capture probe includes a spatial barcode and the capture domain; (b) binding a first probe oligonucleotide and a second probe oligonucleotide to a target analyte in the biological sample, wherein: the first probe oligonucleotide and the second probe oligonucleotide are substantially complementary to adjacent sequences of the analyte, the second probe oligonucleotide includes a capture binding domain that binds to a capture domain of a capture probe, wherein the capture binding domain is blocked and prevented from hybridizing to the capture domain of the capture probe affixed to the substrate; (c) ligating the first probe oligonucleotide and the second probe oligonucleotide, thereby creating a ligated probe that is substantially complementary to the analyte; (d) releasing: the ligated probe from the analyte, and the block from the capture binding domain, thereby allowing the capture binding domain to bind to the capture domain of the capture probe on the substrate, thereby enhancing the specificity of binding of a polynucleotide to a target analyte in a biological sample.

In another feature of the disclosure, provided herein is a method for enhancing the specificity of binding of an oligonucleotide to a target analyte in a biological sample, where the method includes: (a) contacting the biological sample with a substrate, wherein the substrate includes a plurality of capture probes affixed to the substrate, wherein the capture probe includes a spatial barcode and the capture domain (b) binding a first probe oligonucleotide and a second probe oligonucleotide to a target analyte in the biological sample, wherein: the first probe oligonucleotide and the second probe oligonucleotide are substantially complementary to adjacent sequences of the analyte, the second probe oligonucleotide includes a capture binding domain that binds to a capture domain of a capture probe, wherein the capture binding domain includes a plurality of caged nucleotides, where a caged nucleotide of the plurality of caged nucleotides includes a caged moiety that blocks hybridization between the capture binding domain and the capture domain of the capture probe affixed to the substrate; (c) ligating the first probe oligonucleotide and the second probe oligonucleotide, thereby creating a ligated probe that is substantially complementary to the analyte; (d) releasing: the ligated probe from the analyte, and the caged moiety from each of the caged nucleotides from the capture binding domain through activation using photolysis, thereby allowing the capture binding domain of the second oligonucleotide to specifically bind to the capture domain of the capture probe on the substrate; thereby enhancing the specificity of binding of a first analyte-binding moiety to a first target analyte and a second analyte binding moiety to a second target analyte in a biological sample.

In another feature of the disclosure, provided herein is a method for enhancing the specificity of binding of a oligonucleotide to a target analyte in a biological sample, where the method includes: (a) contacting the biological sample with a substrate, wherein the substrate includes a plurality of capture probes affixed to the substrate, wherein the capture probe includes a spatial barcode and the capture domain (b) binding a first probe oligonucleotide and a second probe oligonucleotide to a target analyte in the biological sample, wherein: the first probe oligonucleotide and the second probe oligonucleotide are substantially complementary to adjacent sequences of the analyte, the second probe oligonucleotide includes a capture binding domain that binds to a capture domain of a capture probe, wherein the capture binding domain includes a first sequence (i.e., a capture binding domain) that hybridizes to the capture domain of the capture probe and a second sequence (e.g., a blocking probe) that hybridizes to the first sequence, wherein the blocking probe blocks the first sequence from hybridizing to the capture domain of the capture probe affixed to the substrate; (c) ligating the first probe oligonucleotide and the second probe oligonucleotide, thereby creating a ligated probe that is substantially complementary to the analyte; (d) releasing: the ligated probe from the analyte, and the blocking probe of the capture binding domain from the first sequence of the capture binding domain and allowing the capture binding domain to specifically bind to the capture domain of the capture probe on the substrate, thereby enhancing the specificity.

Also provided herein are methods for determining (i) all or a part of the sequence of the ligated probe specifically bound to the capture domain, or a complement thereof, and (ii) all or a part of the sequence of the spatial barcode, or a complement thereof, and using the determined sequence of (i) and (ii) to identify a location of a target analyte in the biological sample.

In some embodiments, the RTL methods that allow for targeted RNA capture as provided herein include a first probe oligonucleotide and a second probe oligonucleotide. The first and second probe oligonucleotides each include sequences that are substantially complementary to the sequence of an analyte of interest. By substantially complementary, it is meant that the first and/or second probe oligonucleotide is at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% complementary to a sequence in an analyte. In some instances, the first probe oligonucleotide and the second probe oligonucleotide hybridize to adjacent sequences on an analyte. Methods provided herein may be applied to hybridization of two or more probe oligonucleotides to a single nucleic acid molecule. In some embodiments, each target analyte includes a first target region and a second target region. In some instances, the methods include providing a plurality of first probe oligonucleotides and a plurality of second probe oligonucleotides. In some instances, a first probe oligonucleotide hybridizes to a first target region of the nucleic acid. In some instances, a second probe oligonucleotide hybridizes to a second target region of the nucleic acid.

In some embodiments, the first and/or second probe oligonucleotide as disclosed herein includes one of at least two ribonucleic acid bases at the 3′ end; a functional sequence; a phosphorylated nucleotide at the 5′ end; and/or a capture binding domain. In some embodiments, the first and/or second probe as disclosed herein includes one of at least two ribonucleic acid bases at the 3′ end; a functional sequence; a phosphorylated nucleotide at the 5′ end; and/or a capture binding domain that includes a plurality of caged nucleotides. In some embodiments, the first and/or second probe as disclosed herein includes one of at least two ribonucleic acid bases at the 3′ end; a functional sequence; a phosphorylated nucleotide at the 5′ end; and a capture binding domain. In some embodiments, the capture binding domain includes a first sequence that binds to a capture domain of a capture probe and a second sequence (e.g., a blocking probe) that hybridizes to the first sequence to prevent interaction of the first sequence with the capture domain of the capture probe. In some embodiments, the functional sequence is a primer sequence. The capture binding domain is a sequence that is complementary to a particular capture domain present in a capture probe. The first sequence of the capture binding domain includes a sequence that is substantially complementary to a portion of the capture domain. In some embodiments, the capture binding domain includes a poly(A) sequence. In some embodiments, the capture binding domain includes a poly-uridine sequence, a poly-thymidine sequence, or both. In some embodiments, the capture binding domain includes a random sequence (e.g., a random hexamer or octamer). In some embodiments, the capture binding domain is complementary to a capture domain in a capture probe that detects a particular target(s) of interest. In some embodiments, the capture binding domain includes a poly-uridine sequence and/or a poly-thymidine sequence, and the blocking domain includes a poly-adenosine sequence. In some embodiments, the capture binding domain includes a random sequence (e.g., a random hexamer or octamer). In some embodiments, the capture binding domain is complementary to a capture domain in a capture probe that detects a particular target(s) of interest.

In some embodiments, the first probe oligonucleotide hybridizes to an analyte. In some embodiments, the second probe oligonucleotide hybridizes to an analyte. In some embodiments, both the first probe oligonucleotide and the second probe oligonucleotide hybridize to an analyte. Hybridization can occur at a target having a sequence that is 100% complementary to the probe oligonucleotide(s). In some embodiments, hybridization can occur at a target having a sequence that is at least (e.g. at least about) 80%, at least (e.g. at least about) 85%, at least (e.g. at least about) 90%, at least (e.g. at least about) 95%, at least (e.g. at least about) 96%, at least (e.g. at least about) 97%, at least (e.g. at least about) 98%, or at least (e.g. at least about) 99% complementary to the probe oligonucleotide(s).

After hybridization of the first and second probe oligonucleotides, in some embodiments, the first probe oligonucleotide is extended. After hybridization, in some embodiments, the second probe oligonucleotide is extended. In some instances, a polymerase (e.g., a DNA polymerase) extends the first and/or second oligonucleotide.

In some embodiments, methods disclosed herein include a wash step. In some instances, the wash step occurs after hybridizing the first and the second probe oligonucleotides. The wash step removes any unbound oligonucleotides and can be performed using any technique or solution disclosed herein or known in the art. In some embodiments, multiple wash steps are performed to remove unbound oligonucleotides.

In some instances, the first and second target regions of the first probe oligonucleotide and the second probe oligonucleotide are adjacent to one another. In some embodiments, the first and second probe oligonucleotides bind to complementary sequences on the same transcript. In some embodiments, the complementary sequences to which the first probe oligonucleotide and the second probe oligonucleotide bind are 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, about 15, about 20, about 25, about 30, about 35, about 40, about 45, about 50, about 55, about 60, about 65, about 70, about 75, about 80, about 85, about 90, about 95, about 100, about 125, or about 150 nucleotides away from each other. Gaps between the probe oligonucleotides may first be filled prior to ligation, using, for example, dNTPs in combination with a polymerase such as Mu polymerase, DNA polymerase, RNA polymerase, reverse transcriptase, Taq polymerase, and/or any combinations, derivatives, and variants (e.g., engineered mutants) thereof. In some embodiments, when the first and second probe oligonucleotides are separated from each other by one or more nucleotides, deoxyribonucleotides are used to extend and ligate the first and second probe oligonucleotides.

In some instances, the first probe oligonucleotide and the second probe oligonucleotide hybridize to an analyte on the same transcript. In some instances, the first probe oligonucleotide and the second probe oligonucleotide hybridize to an analyte on the same exon. In some instances, the first probe oligonucleotide and the second probe oligonucleotide hybridize to an analyte on different exons. In some instances, the first probe oligonucleotide and the second probe oligonucleotide hybridize to an analyte that is the result of a translocation event (e.g., in the setting of cancer). The methods provided herein make it possible to identify alternative splicing events, translocation events, and mutations that change the hybridization rate of one or both probe oligonucleotides (e.g., single nucleotide polymorphisms, insertions, deletions, point mutations).

In some embodiments, after hybridization of probe oligonucleotides (e.g., first and the second probe oligonucleotides) to the analyte, the probe oligonucleotides (e.g., the first probe oligonucleotide and the second probe oligonucleotide) are ligated together, creating a single ligated probe that is complementary to the analyte. Ligation can be performed enzymatically or chemically, as described herein.

In some instances, the first and second probe oligonucleotides are hybridized to the first and second target regions of the analyte, and the probe oligonucleotides are subjected to a nucleic acid reaction to ligate them together. In some instances, the ligation is enzymatic. In some instances, the ligation reaction catalyzed by a ligase disclosed herein occurs in the following steps. In some instances, the ligase enzyme is pre-activated by charging with ATP. Addition of ATP to ligase enzyme causes formation of an intermediate AMP-enzyme species concomitant with hydrolysis of ATP to yield AMP. In some instances, the pre-activating step does not occur. In some instances, the next step includes methods where the charged AMP-enzyme intermediate binds to the dsDNA (or dsRNA, or RNA/DNA complex) and transfers the AMP moiety to the free 5′ terminal phosphate, to form a high energy 5′-5′ phosphate bond. In some instances, the third step includes methods wherein the enzyme provides the appropriate environment in which the 3′ hydroxyl group of the second strand of DNA (or RNA) is able to attach the high energy 5′-5′ phosphate bond, thereby forming a covalent phosphodiester bond as a product and releasing ligase enzyme and AMP. Free enzyme does not bind the intermediate high energy 5′-5′ phosphate bond species to an appreciable amount. Thus, if the ligase prematurely releases from the duplex after formation of the high energy 5′-5′ phosphate bond, the reaction will typically end and the intermediate will not proceed to the final ligated product.

In some instances, the probes may be subjected to an enzymatic ligation reaction, using a ligase (e.g., T4 RNA ligase (Rnl2), a splintR ligase, a single stranded DNA ligase, or a T4 DNA ligase). See, e.g., Zhang L., et al.; Archaeal RNA ligase from thermoccocus kodakarensis for template dependent ligation RNA Biol. 2017; 14(1): 36-44, which is incorporated by reference in its entirety, for a description of KOD ligase. In some instances, the ligase is T4 RNA ligase (Rnl2). T4 DNA ligase is an enzyme belonging to the DNA ligase family of enzymes that catalyzes the formation of a covalent phosphodiester bond from a free 3′ hydroxyl group on one DNA molecule and a free 5′ phosphate group of a second, separate DNA molecule, thus covalently linking the two DNA strands together to form a single DNA strand. In some instances, the ligase is splintR ligase. SplintR Ligase, also known as PBCV-1 DNA Ligase or Chorella virus DNA Ligase, efficiently catalyzes the ligation of adjacent, single-stranded DNA oligonucleotides splinted by a complementary RNA strand. In some instances, the ligase is a single-stranded DNA ligase.

In some embodiments, adenosine triphosphate (ATP) is added during the ligation reaction. DNA ligase-catalyzed sealing of nicked DNA substrates is first activated through ATP hydrolysis, resulting in covalent addition of an AMP group to the enzyme. After binding to a nicked site in a DNA duplex, the ligase transfers this AMP to the phosphorylated 5′-end at the nick, forming a 5′-5′ pyrophosphate bond. Finally, the ligase catalyzes an attack on this pyrophosphate bond by the OH group at the 3′-end of the nick, thereby sealing it, whereafter ligase and AMP are released. If the ligase detaches from the substrate before the 3′ attack, e.g. because of premature AMP reloading of the enzyme, then the 5′ AMP is left at the 5′-end, blocking further ligation attempts. In some instances, ATP is added at a concentration of about 1 µM, about 10 µM, about 100 µM, about 1000 µM, or about 10000 µM during the ligation reaction.

In some instances, cofactors that aid in joining of the probe oligonucleotides are added during the ligation process. In some instances, the cofactors include magnesium ions (Mg2+). In some instances, the cofactors include manganese ions (Mn2+). In some instances, Mg2+ is added in the form of MgCl2. In some instances, Mn2+ is added in the form of MnCl2. In some instances, the concentration of MgCl2 is at about 1 mM, at about 10 mM, at about 100 mM, or at about 1000 mM. In some instances, the concentration of MnCl2 is at about 1 mM, at about 10 mM, at about 100 mM, or at about 1000 mM.

In some instances, the ligation reaction occurs at a pH in the range of about 6.5 to 9.0, of about 6.5 to 8.0, of about 7.5 to 8.0, of about 7.5, or of about 8.0.

Following the enzymatic ligation reaction, the first and second probe oligonucleotides may be considered ligated (e.g., thereby generating a ligated probe, ligated product or ligation product).

In some embodiments, the probe oligonucleotides (e.g., the first probe oligonucleotide and the second probe oligonucleotide) may each comprise a reactive moiety such that, upon hybridization to the target and exposure to appropriate ligation conditions, the probe oligonucleotides may ligate to one another. In some embodiments, probe oligonucleotide that includes a reactive moiety is a ligated chemically. For example, a probe oligonucleotide capable of hybridizing to a sequence 3′ of a target sequence (e.g., a first target region) of a nucleic acid molecule may comprise a first reactive moiety, and a probe oligonucleotide capable of hybridizing to a sequence 5′ of a target sequence (e.g., a second target region) of the nucleic acid molecule may comprise a second reactive moiety. When the first and second probe oligonucleotides are hybridized to the first and second target regions of the nucleic acid molecule, the first and second reactive moieties may be adjacent to one another. A reactive moiety of a probe may be selected from the non-limiting group consisting of azides, alkynes, nitrones (e.g., 1,3-nitrones), strained alkenes (e.g., trans-cycloalkenes such as cyclooctenes or oxanorbornadiene), tetrazines, tetrazoles, iodides, thioates (e.g., phorphorothioate), acids, amines, and phosphates. For example, the first reactive moiety of a first probe oligonucleotide may comprise an azide moiety, and a second reactive moiety of a second probe oligonucleotide may comprise an alkyne moiety. The first and second reactive moieties may react to form a linking moiety. A reaction between the first and second reactive moieties may be, for example, a cycloaddition reaction such as a strain-promoted azide-alkyne cycloaddition, a copper-catalyzed azide-alkyne cycloaddition, a strain-promoted alkyne-nitrone cycloaddition, a Diels-Alder reaction, a [3+2] cycloaddition, a [4+2] cycloaddition, or a [4+1] cycloaddition; a thiol-ene reaction; a nucleophilic substation reaction; or another reaction. In some cases, reaction between the first and second reactive moieties may yield a triazole moiety or an isoxazoline moiety. A reaction between the first and second reactive moieties may involve subjecting the reactive moieties to suitable conditions such as a suitable temperature, pH, or pressure and providing one or more reagents or catalysts for the reaction. For example, a reaction between the first and second reactive moieties may be catalyzed by a copper catalyst, a ruthenium catalyst, or a strained species such as a difluorooctyne, dibenzylcyclooctyne, or biarylazacyclooctynone. Reaction between a first reactive moiety of a first probe oligonucleotide hybridized to a first target region of the nucleic acid molecule and a second reactive moiety of a third probe oligonucleotide hybridized to a second target region of the nucleic acid molecule may link the first probe oligonucleotide and the second probe oligonucleotide to provide a ligated probe. Upon linking, the first and second probe oligonucleotides may be considered ligated. Accordingly, reaction of the first and second reactive moieties may comprise a chemical ligation reaction such as a copper-catalyzed 5′ azide to 3′ alkyne “click” chemistry reaction to form a triazole linkage between two probe oligonucleotides. In other non-limiting examples, an iodide moiety may be chemically ligated to a phosphorothioate moiety to form a phosphorothioate bond, an acid may be ligated to an amine to form an amide bond, and/or a phosphate and amine may be ligated to form a phosphoramidate bond.

In some embodiments, after ligation of the first and second probe oligonucleotides to create the ligated probe, the ligated probe is released from the analyte. In some embodiments, the ligated probe is released enzymatically. In some embodiments, an endoribonuclease is used to release the probe from the analyte. In some embodiments, the endoribonuclease is one or more of RNase H, RNase A, RNase C, or RNase I.

In some instances, the endoribonuclease is RNAse H. RNase H is an endoribonuclease that specifically hydrolyzes the phosphodiester bonds of RNA, when hybridized to DNA. The RNases H are a conserved family of ribonucleases which are present many different organisms. There are two primary classes of RNase H: RNase H1 and RNase H2. Retroviral RNase H enzymes are similar to the prokaryotic RNase H1. All of these enzymes share the characteristic that they are able to cleave the RNA component of an RNA:DNA heteroduplex. In some embodiments, the RNase H is RNase H1, RNase H2, or RNase H1, or RNase H2. In some embodiments, the RNase H includes but is not limited to RNase HII from Pyrococcus furiosus, RNase HII from Pyrococcus horikoshi, RNase HI from Thermococcus litoralis, RNase HI from Thermus thermophilus, RNAse HI from E. coli, or RNase HII from E. coli.

In some embodiments, after creating a ligated probe from the probe oligonucleotides (e.g., a first probe oligonucleotide and second probe oligonucleotide), the biological sample is permeabilized. In some embodiments, permeabilization occurs using a protease. In some embodiments, the protease is an endopeptidase. Endopeptidases that can be used include but are not limited to trypsin, chymotrypsin, elastase, thermolysin, pepsin, clostripan, glutamyl endopeptidase (GluC), ArgC, peptidyl-asp endopeptidase (ApsN), endopeptidase LysC and endopeptidase LysN. In some embodiments, the endopeptidase is pepsin.

Detailed descriptions of targeted RNA capture using RNA-templated ligation (RTL) has been disclosed in US Application No. 62/952,736, the entirety of which is incorporated herein by reference.

In some embodiments, after creating a ligated probe from the probe oligonucleotides (e.g., a first probe oligonucleotide and second probe oligonucleotide), the biological sample is permeabilized. In some embodiments, permeabilization occurs using a protease (e.g., an endopeptidase disclosed herein).

In some embodiments, the ligated probe includes a capture binding domain, which can hybridize to a capture probe (e.g., a capture probe immobilized, directly or indirectly, on a substrate), and a blocking domain, which prevents interaction between the capture binding domain and a capture domain of a capture probe. Non-limiting examples of blocking domains are described herein.

In some embodiments, the ligated probe includes a capture binding domain, which can hybridize to a capture probe (e.g., a capture probe immobilized, directly or indirectly, on a substrate), where the capture binding domain includes a plurality of caged nucleotides, wherein a caged nucleotide of the plurality of caged nucleotides includes a caged moiety that is capable of preventing interaction between the capture binding domain and the capture domain of the capture probe. Non-limiting examples of caged nucleotides, including examples of caged moieties, are described herein.

In some embodiments, methods provided herein include contacting a biological sample with a substrate, wherein the capture probe is affixed to the substrate (e.g., immobilized to the substrate, directly or indirectly). In some embodiments, the capture probe includes a spatial barcode and the capture domain. In some embodiments, prior to hybridization, the block is released from the capture binding domain of the ligated probes. In another embodiments, prior to hybridization, the caged moieties are released from the capture binding domain. The ligated probe then binds to the capture domain of the capture probe. After hybridization of the ligated probe to the capture probe, the ligated probe is analyzed (e.g., the sequence is determined) using methods disclosed herein, including but not limited to extension of the probe, RT, addition of adaptors, and sequencing.

(E) Kits

In some embodiments, also provided herein are kits that include one or more reagents to detect one or more analytes described herein. In some instances, the kit includes a substrate comprising a plurality of capture probes comprising a spatial barcode and the capture domain. In some instances, the kit includes a plurality of an analyte binding moieties as described herein. In some instances, the analyte-binding moiety of the plurality is associated with an oligonucleotide comprising an analyte-binding-moiety barcode and the capture binding domain. In some instances, the kit includes a plurality of first probe oligonucleotides and second probe oligonucleotides as described herein. In some instances, the first probe oligonucleotide and a second probe oligonucleotide each comprises sequences that are substantially complementary to an analyte, and wherein the second probe oligonucleotide comprises a capture binding domain. In some instances, the kit further includes a plurality of blocking probes as described herein.

In some embodiments, reagents can include one or more antibodies (and/or antigen-binding antibody fragments), labeled hybridization probes, and primers. For example, in some embodiments, an antibody (and/or antigen-binding antibody fragment) can be used for visualizing one or more features of a tissue sample (e.g., by using immunofluorescence or immunohistochemistry). In some embodiments, an antibody (and/or antigen-binding antibody fragment) can be an analyte binding moiety, for example, as part of an analyte capture agent. Useful commercially available antibodies will be apparent to one skilled in the art.

In some embodiments, labeled hybridization probes can be used for in situ sequencing of one or more biomarkers and/or candidate biomarkers. In some embodiments, primers can be used for amplification (e.g., clonal amplification) of a captured oligonucleotide analyte.

In some embodiments, a kit can further include instructions for performing any of the methods or steps provided herein. In some embodiments, a kit can include a substrate with one or more capture probes comprising a spatial barcode and a capture domain that binds to a biological analyte from a tissue sample, and reagents to detect a biological analyte, wherein the biological analyte is any of the biomarkers of this disclosure. In some embodiments, the kit further includes but is not limited to one or more antibodies (and/or antigen-binding antibody fragments), labeled hybridization probes, primers, or any combination thereof for visualizing one or more features of a tissue sample. 

What is claimed is:
 1. A method for identifying the location of an analyte in a biological sample comprising: (a) contacting the biological sample with a substrate comprising a plurality of capture probes comprising a spatial barcode and a capture domain; (b) binding an analyte-binding moiety to the analyte, wherein the analyte-binding moiety is associated with an oligonucleotide comprising an analyte-binding-moiety barcode and a capture binding domain that hybridizes to the capture domain of the capture probe, wherein a portion of the capture binding domain is hybridized to a blocking probe; (c) releasing the blocking probe from the capture binding domain and hybridizing the capture binding domain to the capture domain; and (d) determining (i) all or a part of the sequence of the oligonucleotide bound to the capture domain, or a complement thereof, and (ii) all or a part of the sequence of the spatial barcode, or a complement thereof, and using the determined sequence of (i) and (ii) to identify a location of the analyte in the biological sample. 